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PREFACE 


The physical sciences have made such advances in the last 
twenly or thirty years that it would be difficult to name a single 
branch of human knowledge that has not been affected significant- 
ly, or even radically, as a result. Modern engineering is to a large 
extent the creation of physics. New ideas and facts discovered by 
physicists become an integral part of man’s knowledge of the sur- 
rounding world. The problem of the structure of malter, whose 
investigation previously appeared as a peculiar form of mental 
exercise, has come to the forefront in science as a most important 
task that is inseparably linked with the development of civilisation. 

A broad knowledge of physics is a necessity for the specialist 
working im any branch of science or engineering if he desires to 
comprehend the fundamentals of his field of knowledge and is striv- 
ing to take a creative part in ils development. The task of a course 
in physics for students of a technical institule consists, therefore, in 
helping them to understand the physical basis of engineering. 

In addition to this main task, a course in physics in a technical 
institute should be organised in such a manner as to help the student 
to master experimental technique and acquaint him with equipment 
used to measure physical quantities. Skill in experimental physics 
is attained by working in the laboratory. It seems to us that famil- 
iavising oneself with experimental physics is a completely distinct 
task in the study of physics in technical institutes. The interweaving 
of experimental physics with the study of general physical laws 
and phenomena is only occasionally pedagogically justified. This 
is due to the fact that modern experimental physics cannot be 
sharply subdivided. The measurement of coefficients of expansion is 
accomplished with the aid of interferometry, radio equipment is 
required for experiments in mechanics and heat, and the investiga- 
tion of the structure of metals is inseparably linked with experiments 
in electricity. Physical experiments conducted with the aid of 
outmoded techniques are of interest only to specialists in physics 
desiring to trace the development of one or another experiment. It 
would probably be most expedient to arrange the curriculum in such 
a manner that laboratory work followed a course in general 
physics. 
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Thus, the author believes that lectures in physics, and conse- 
quently the corresponding textbook, should include only outlines 
of experiments, i.e., the goal of the experiments. 

Once agreed on the necessity for excluding experimental physics 
from our course, we must then choose between the inductive approach 
(from particular experimental facts to theoretical generalities) 
and the deductive approach (from theory to its experimental corro- 
boration and manifold applications). In a very extensive course, it 
is probably possible to combine these two approaches as they are 
linked in the development of science. This possibility was not open 
to the author and so the second approach was chosen. Presentation 
of the basic theoretical propositions, the deduction of corollaries 
that could be verified experimentally, and then the illustration of 
these experiments by means of diagrams—this was the approach 
adopted in practically every chapter of this book. Naturally, this 
meant that the historical method had to be completely disregarded. 
The history of the origin of ideas, the formulation and discard of 
physical theories, remained beyond the scope of this book, since it is 
written for the student who is not aiming to be a professional physi- 
cist. It seems to me that only such a method of presentation makes 
for clarity and conciseness. 

This approach is also dictated by the structure of the course. If 
the fundamental laws of physics are regarded as of paramount 
importance and we carefully consider the body of facts pertaining 
to a particular complex of theoretical ideas, then it would be expedi- 
ent to separate the presentation of the strictly phenomenological 
theories and their corollaries from the problems relating to the 
structure of matter. Classical mechanics, thermodynamics, statisti- 
cal physics and electrodynamics are self-contained and to a large 
extent in themselves complete fields of physics. Their presentation 
should precede the consideration of the problems relating to the 
structure of matter. It is expedient to consider the connection be- 
tween physical properties and the structure of matter after the funda- 
mentals of classical physics have been presented and after the 
theories of the structure of matter have been discussed. 

In many books on physics, problems related to the structure of 
matter and problems connecting the properties of matter with its 
structure are scattered through various parts of the course and to 
a large extent are lost, i.e., dispersed in other topics. It seems to 
me that this is not justified both from the point of view of the logi- 
cal structure of the course as well as from the viewpoint of the 
importance of these problems, especially if it is borne in mind that 
many institutions of higher education teach chemistry and technolo- 
gy, and specialise in matter and materials. The attention of the 
reader is called to the fact that problems related to the structure of 
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matter are discussed in a separate, concluding section. This is clearly 
the basic dilference, as regards presentation, between this book 
and others. 

Another distinguishing feature of this textbook on physics is 
the integrated presentation of problems in radio physics and optics. 
This helps to give the reader a better understanding of the essence 
of physical phenomena. 

In view of the relatively short time allotted to a course in physics, 
one is forced to seek methods of presentation that are as condensed 
and concise as possible. At Lhe same time, it is impermissible to treat 
superficially a number of problems in physics, the knowledge of 
which is a sine qua non today for every qualified chemist, technolo- 
gist’ and engineer. The reconciliation of these two contradictory 
requirements placed the author in a difficult situation. He decided, 
finally, to completely exclude repetition of all secondary-school 
material. However, this measure alone was insufficient to make 
room in this single volume for all the important problems of physics. 
To further reduce the size of the book, a concise presentation of the 
material was necessary. Some people feel that lectures and textbooks 
should include selected problems which are exhaustively discussed, 
and all that cannot be presented historically and logically, in 
complete detail should be excluded from the physics course. Such 
a viewpoint. it seems to me, is basically incorrect. A detailed knowl- 
edge of certain lopics is not essential for the future engineer. At 
the same time, however, he must have a general notion of these 
topics, if only to be aware of the existence of these phenomena and 
laws. Therefore, on various occasions only the theoretical con- 
clusions have been presented and our knowledge in a number of 
branches of science is merely summarised, i.e., a detailed descrip- 
tion of the manner in which this knowledge was obtained has not 
been given despite its interesting and educational nature. 

At first glance, it may appear that the author, overlooking the 
fact that mathematics and physics generally are taught concurrently, 
has not employed mathematics wisely. Thus, the definite integral 
Symbol appears in our book before the student has studied the 
integral calculus. However, an understanding of the text does not 
require that the student be able to integrate, but that he merely be 
familiar with the concept of the definite integral. It would be inadvis- 
able and cumbersome to avoid such formulas in a physics textbook, 
and it would result in an impoverished presentation of the physical 
concepls involved. A more profitable approach is to introduce some 
minor changes in the curriculum with respect to mathematics, so 
that the students are initiated into the concepts of mathematical 
analysis early in the course. This suffices for the student to be able 
to read books using these symbols. The process of integration may be 
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studied considerably after the relatively easy concept of the definite 
integral has been assimilated. 

The interconnection between the various branches of physics has 
its effect on the presentation of theory and experimental results. 
If the lecturer desires to maintain logical presentation in the course, 
he must inevilably resort to repetilion and cross-reference. There are 
innumerable examples of this. Thus, on the one hand, it is not 
desirable to separate the presentation of X-ray diffraction from 
optical diffraction. On the other hand, it is impermissible to disasso- 
ciate the latter problem from the problems of crystal structure. Or, 
as another example, it is difficult to treat problems of permittivity 
and dipole moments in separate sections, but at the same time il is 
not possible to discuss dipole moments without discussing other 
problems related to the structure of molecules. Furthermore, permit- 
tivity and coefficients of refraction must be discussed together. 
There is, of course, only one solution: in lectures, repetition is essen- 
tial (very useful, incidentally, for students); and in the textbook, 
recourse must be had to cross-reference. Owing to cross-referencing, 
a physics textbook cannot be read a single time in consecutive order. 
This also means that in order to properly understand one portion of 
the physics field, familiarity with the field as a whole is generally 
necessary. ‘ 


A. Kitaigorodsky 


PART ONE 
MECHANICAL AND THERMAL} MOTION 


CHAPTER I 


THE FUNDAMENTAL LAW OF MECHANICS 


1. Kinematics 


Equations of Motion of a Particle. If the dimensions and shape 
of a body are of no consequence in the consideration of a particular 
phenomenon, we can conceive of the body as being represented by 
a point. This approximate representation of a body by a material 


Zz 


Fig. 1 


(i.e., mass) point is not only justified when the dimensions of the 
body are small relative to other distances considered in the problem, 
but is permissible whenever we are only interested in the motion of 
the centre of mass of the body. 

In order to describe the motion of a particle, one must indicate 
through which points in space the particle has passed and the 
instants of time during which it was located at one or another 
point of the path. For this purpose, it is necessary, in the first place, 
to select a coordinate frame of reference (Fig. 1). The location of 

OF 
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a point in such a coordinate system, which in its simplest form 
is right-angled, is delermined by the three coordinales 27, y, 5. or 
by the so-called radius vector », drawn from the origin of the coordi- 
nate system to the given point* (Fig. 2). 

Thus, motion in space can be roughly described in the form of 
a table of values for » (each value being given by three quantities!) 
for the instants of time f,, f2, etc.; or accurately described in the 


A 


Fig. 2 Fig. 3 


form of a continuous function 1 = f(t) lin essence, three functions, 
eg., x = fit), y = fo(t), 2 = fa(t); or r= Q(t), @ = galt), B = 
= g(t); etc.] 

The vector equation » = f(t) or, what amounts to the same, the 
three equivalent scalar equations are called the equations of motion. 

Average Velocity. Let us consider AB, a portion of the path. 
Assume that at the instant of time ¢ the moving particle was at A, 
and at the instant of time ¢ + At at B (Fig. 3). Let us introduce the 
radius vectors #, and 1g. We know that during the interval of time 
At, the particle moved from A to B. It is therefore natural to call 


the vector AB the particle displacement vector. 
Vectors may he added by the parallelogram method. From 
Fig. 3, we see that 


= > 
Vp=?at+AB or AB=rg—P,= A?, 


* The radius vector is given by its magnitude, » = 2? + y? + =?, and 


. . m Lr J 
the angles it forms with the coordinate axes: cos =: — , cos p= and 


cos p=— . Thus, it is determined by three quantities: z, y and 2; or r, @ and 


B; or r, @ and y; etc. (two angles determine the third, since cos?a +- cos?—B + 
+ cos’y = 1). 
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i.c., the particle displacement vector is the vector difference of 
the radius vectors. The curvilinear motion is determined by the 
displacement vector Ay for time At, whereby the smaller Av, the 
greater the accuracy. 
The average speed for the path AB is given by the relation 
_ AB 
Yaw = TP 
This is the speed at which the body would have traversed the dis- 
tance AB in uniform and rectilinear motion during the interval of 
lime At. | 
Thus, motion over the path AB may be specified by giving 
> 
the direction of the'vector AB = Av and the speed v,,. In place 
of this, we introduce the vector 


which is equal in magnitude to the average speed and whose direc- 
tion is that of the displacement vector. We can now say that the 
motion of the body over the path AB is determined by the average 
velocity. ; 

Instantaneous Velocity. If we decrease the interval of time At, 
the point B will approach point A. These points finally merge and 


>> 
the direction of AB then coincides with the tangent to the curve at 
the point of merger. 


. AB ete 
As At decreases, the ratio ae approaches a limit. The vector 


Vinst, having the direction of the tangent to the curve at the 
given moment of motion and numerically equal to the limit of 


the ratio £6 as At > 0, is called the instantaneous particle velocity: 


Vinst = limit 52 when At — 0. 


In other words, the instantaneous velocity is the derivative of the 
vector # with respect to time: 


It should again be emphasised that it is not absolutely essential 
to employ vectors in order to describe motion. Instead of using 
the concept of vector velocity, we could speak of the absolute 
value of the velocity, [ent * and indicate the direction of motion 

* The vertical bars || indicate that only the absolute value (modulus) 
of the vector between the bars is being considered. 
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If we did this, however, the same rules and the same experimental 
facts would require more cumbersome and more wordy formula- 
tions. Vector notation corresponds to physical experience, and is 
moreover concise and expressive. A cerlain amount of effort, how- 
ever, is required lo become accustomed lo it. 

Since the projections of the vector # on the coordinale axes are 
the coordinates of its terminus, z, y and z, the projections of the 
velocity vector are: 


ee — id 2 
Bode ape Ae 


Acceleration. To continue our consideration of curvilinear motion, 
let us draw arrows to represent the instantaneous velocities of the 
body in passing through the points A and B of ils path. If we had 
not introduced the concept of velocity, we would have to describe 
the situation as follows: the speed at B is different from that at A; 
moreover, the direction of motion has changed. Using the concept 
of velocity, we can state more briefly: the velocity at B is differ- 
ent from that at A. 

Velocity can change in magnitude and direction. 

If the path AB is rectilinear, the vectors v, and vz, have the 
same direction. The change in velocity is obtained by arithmeti- 
cally subtracting the magnitude of the vector v, from the magni- 
tude of the vector vz 

Let us now consider the curvilinear path AB; vectors vy and vz 
differ in magnitude as well as 
in direction. To determine 
the increase in the magnitude 
of the velocity, it is neces- 
sary, as before, Lo subtract the 
magnitude of the vector vt, 
from the magnitude of the 
vector vy: 


A|v|=|e|—[eal 


However, this quantity does 

not, of course, completely 

express the change that has 
z Fig. 4 occurred in the motion. 

: Let us now subtract vector 

v, from vector v, in accord- 

ance with the laws for operating on vectors. Fig. 4 shows vector 


Av = Ug— U4. 
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Vector v,. the sum of Av--v,, is the diagonal of the parallelogram 
constructed on these vectors. 

Vector Av is called the velocity increment. The magnitude of 
this vector in the case of curvilinear motion is not A|v| = |vg| — 
—|v,|. From the figure, it is evident that the magnitude of the 
increment vector| Av|is greater than A] v|, the difference in the magni- 
tudes of the velocities. To determine the velocity at point B, one 
must add velocity v, and increment Av by the parallelogram method. 

We can now determine the acceleration for curvilinear motion 
as follows. The ratio of the velocity increment to the interval of 
time during which this increment takes place is called average 
acceleration: 

Av 


Cav = Tr . 


When the interval of time At is decreased, this ratio approaches 
a limit. The vector 


tinsi = limit 47 when At — 0 


is called the instantaneous acceleration of a body at a given moment 
of motion. In other words, acceleration is the derivative of velocity: 


du 
“dt 
and 


dvy dvy = dv, 
CS ae 3 Qy =H ? a, = dt 


The acceleration vector uniquely determines the nature of the 
change in the velocily of the body. 

Generally speaking, the acceleration vector can form any angle 
‘with the curve. This angle determines the nature of the acceleration 
and the curvature of the path as follows. Through the point of the 
curve that is being considered, a circle is drawn that has a common 
tangent with the path of motion at this point, and for the given 
portion of the curve most accurately approximates it. This circle 
is called a tangential circle * and its radius p is called the radius 
of curvature at the given point. The acceleration vector is always 
directed into this circle. If the motion is accelerated, the vector a 
forms an acute angle with the curve (i.e., with the tangent to the 
path at the given point). If the motion is retarded, this angle will 


* The tangential circle and the calculation of radius of curvature is 
studied in detail in courses on differential geometry. 
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be obtuse. Finally, if the magnitude of the velocity does not change, 
the acceleration vector is directed normal to the curve. 

These statements can be proved rigorously, but we shall merely 
illustrate them geometrically here (see Fig. 5). 

In line with the above discussion, it is customary to resolve the 
acceleration vector into two components (Fig. 6): 


a=A; +a. 
Since the vector triangle is right-angled, 
a=Vajit+ai. 


The vector @;, directed along the curve, represents the change in 
the magnitude of the velocity and is called the tangential acceler- 
ation. It is not difficult to 


y show that the tangential 
\aV % acceleration 
a, = limit Ale | when 
: d|v| 
At— 0, i.e., a= ee 


Y>y, Gay 

where A|v| is the increment 
in the magnitude of the ve- 
locity. 

The vector @,, directed 
normal to the curve, repre- 
sents the change in the direc- 
tion of the velocity and is 
called the normal accelera- 


Ye 
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tion. The normal acceleration a, is related by a simple for- 
mula to the speed v and the radius of curvature p at the given 
point, namely: 

re 
Kp——- 


From this formula, which is derived in courses in theoretical mechan- 
ics on the basis of geometrical considerations, it follows that motion 
with a constant normal acceleration (a, and v constant quantities) 
is circular molion. In this case, p is a constant quantity for all 
points along the path and is equal to the radius of the circle. 


UU 


y Peis; S 3 
The normal acceleration a, = FS is often also called centripetal 


acceleration. 
Centripetal acceleration of a body moving in a circle with radius 
R can also be expressed by means of the period 7, the frequency v, 
or the angular velocity @ of this motion. Between these quantities 
and the linear velocity v, the following simple relations exist: 
2aR 
T 


4 2m 
; v=oR, v= and o=z: 


The last two formulas define the auxiliary quantities v and o. 

Thus, the centripetal acceleration for the motion of a body in 
a circle can also be written in the form: 

Ax2 
Qn=OFR or dn=—pr KR. 

It should be emphasised that the everyday understanding of the 
word “acceleration” is much more limited than in physics. The con- 
cept of acceleration in physics includes retardation (negative accel- 
eration) and, what is most important, includes uniform motion if 
this motion is along a curved path. Only motion that is simulta- 
neously rectilinear and uniform is considered motion without 
acceleration. 


Examples of acceleration. A proton in a modern accelerator moves in @ 
circle with a normal acceleration of the order of 10'® m/sec?. The linear accel- 
eration of a modern rocket is ~30 m/sec?. The acceleration of a hockey ball 
is ~10 m/sec2. The initial acceleration of an automobile is 1-2 m/sec?. The 
angular velocity of the rotor of a turbogenecrator is 314 rads/sec and, at a dis- 
tance of 0.5 metre from the axis of rotation, particles move with an accelera- 
tion of ~5 X 104 m/sec?. The angular velocity of a bicycle wheel is 7- 
10 rads/sec and, at a radius of 0.5 metre, particles on the rim have a normal 
acceleration of about 20 m/sec?. 


2. Force 


At the present time, three types of interaction are known to the 
physicist. The laws of gravitational and electromagnetic forces 
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have been thoroughly investigated, and nuclear forces are being 
intensely studied. 

Gravitational Foree. The force of altraction belween heavenly 
bodies, which was discovered by Newton and is otherwise known 
as gravitational force, acts between any lwo particles in accordance 
with the law 


__ 4, IMying 
Pays 


1 sf P 
where VSaeX 10° dyne X cm?/gm?, m, and mz, are the masses of 


the particles, and r is the distance between them. 

It can be rigorously proved, but we shall not do so here, that 
Newton’s law of gravitation written in the form valid for bodies 
having small dimensions (small with respect to the distance between 
them) is also valid for the interaction of a small body with a large 
sphere. The distance, here, is understood to be measured between 
the centres of the bodies. 

.The law of universal gravitation for the case of the attraction 
of a body by the Earth can, therefore, be written in the form 


M 
f= ¥ TRE 


where fh is the height above the Earth’s surface and F is the radius 
of the Earth. For points close to the Earth’s surface, kh is so much 


smaller than R that # +h may be replaced by R. Then, F=y iam. 


Comparing this formula with the usual expession for weight, F = 
=mg, we see that the gravitational acceleration may be expressed. 
in terms of the gravitational constant, the mass, of the Earth, and 
the radius of the Earth: 
M 
B= Ye 


Since the gravitational force is proportional to the masses, it 
is very large for heavenly bodies and negligibly small for the ele- 
mentary particles. In the interaction between atoms, molecules and 
other particles of matter, the gravitational force is of no significance. 

The force of attraction between the Moon and the Earth is 
2.3-x 107° dynes, between the Earth and a molecule of oxygen 
~5 « 10-79 dyne, and between two oxygen molecules that are touching 
each other (3A = 3 X 10° cm), ~2 x 10-87 dyne. These figures 
speak for themselves. 

Electromagnetic Force. If two particles or bodies have electric 
charges g; and qo, there is a force of attraction between them if 
the charges are of opposite sign and a force of repulsion if the charges 
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are of equal sign. Quantitalively this relationship is expressed by 
’ dG : . : ° 
Coulomb's law: F= OR, As in the case of universal gravitation, 


this formula is valid for small particles. We shall show in 
Sec. 111 that magnetic force and electric force are intimately related. 
All electromagnetic interaction is of a single nature. 

The interaction between atoms, intermolecular forces, and the 
forces holding electrons about an atomic nucleus are all forces of 
electrical origin. In order to again demonstrate the negligible nature 
of the gravitational interaction between elementary particles, we 
compare gravilational attraction with the electric attraction 
between a hydrogen atom’s nucleus and its single electron: 


F=9X 10-8 dyne, while Fgrav= 4 1072 dyne! 


At first glance, ii may not appear understandable why the inter- 
action between neulral atoms and molecules is of electrical origin. 
We shall go into this in more detail in Chapter 29. However, we 
should note here that the forces between the atoms and molecules 
do not depend on the overall charge of the particles (which is equal 
to zero), but on the local concentration of electric charge. 

Since intermolecular force is of electrical origin, surface tension 
and all cohesion forces between bodies are of the same origin. 
Frictional force is also essentially based on electric interaction. 

The elastic force that is developed when rubber or a compressed 
metal spring is extended is due to interatomic and intermolecular 
interaction. 

Thus, it too, in the final analysis, is electromagnetic in na- 
ture. 

Nuclear Force. There are forces between neutral particles in an 
atomic nucleus (also between a neutron and a proton and between 
two protons) that cannot be explained on the basis of electromag- 
netism. These forces decrease very rapidly with increasing distance 
between interacting particles. As a result, these forces do no exist 
beyond the bounds of nuclei and are evident only in connection 
with phenomena involving direct interaction of nuclei. 

Force Field. The space in which gravitational force is effective 
is called a gravitational field. Similarly, we speak of an electro- 
magnetic field. Any particle acted on by a force field can also create 
such a field. Thus, every particle creates a gravitational field and 
is acted on by gravity; and every electrically charged particle 
creates an electromagnetic field and is acted on by an electromag- 
netic field. 

Thus, every interaction of particles is depicted in physics accord- 
ing to the scheme: particle—field—particle. The first particle 
creates a field, and this field acts on the second particle. 
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The field created by particles is material. The properties of 
a field are essentially different from the properties of substance. As 
a result, it is often said today that matter has two forms—lield and 
substance. The problems of the interrelation of field and substance 
are at present under intense investigation and cannot, as yel, be 
considered solved (see p. 555 for a more detailed discussion). 


3. The Fundamental Law of Mechanies 


Newton’s Laws. The fundamental law of mechanics is the rela- 
tionship found by Newton between the forces acling on a body and 
the acceleration acquired by the body under the action of these 
forces. This law is usually formulated for particles. This in no way 
limits the universality of the law, inasmuch as a complex hody 
can be considered, in principle, as the sum total of all ils particles. 
Moreover, Newton’s equation has extraordinarily broad direct 
application, since in most problems in mechanics we are either 
concerned with bodies having small dimensions or are interested 
only in the motion of the body’s centre of mass. 

The fundamental law of mechanics states the following. If the 


forces f;, f2, f3, etc., whose sum total is # = Ss act on a body, 
the acceleration acquired by the body is equal to the quotient ob- 
tained by dividing the resultant force by the mass of the particle: 


c=. 
m 


The equation also states that the acceleration vector coincides with 
the direction of the resultant force. The constant of proportionality 
in this formula is assumed to be equal to unity, which, the student 
will recall from his earlier training, depends on the choice of the 
system of units for the quantities entering into this equation. 
The fundamental law of mechanics may also be written in the form 


or Pa Sm) The latter equation is equivalent to the former only 
if the mass does not change during the motion. We shall adhere 
to this condition. The case of variable mass will be considered 
below. In Chapter 3, we shall briefly discuss the equation of motion 
for bodies of variable mass in the range typical for rockets and, 
in Chapter 24, we shall consider the complications arising when 
a body moves ‘with a speed approaching that of light (mechanics 
of the theory of relativity). 

The fundamental law of mechanics should be considered as a law 
generalising observed facts. This equation cannot be theoretically 
derived from any simple general considerations. 


=) 
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The law:of inertia follows directly from the fundamental law. 
If there are no forces acting on the body, the acceleration is equal 
to zero and the motion of the body is rectilinear and uniform. 

In applying Newton’s fundamental law to a particular body, we 
focus our atlention on this body and consider the forces acting on it. 
If should not be forgotten, however, that foree is a measure of the 
interaction between bodies and that one-sided interaction does 
not exist. If one body acts on another, the laiter also acts on the 
former. The measurement of force is equivalent to the measurement 
of interaction. Thus, the very method of measuring force assumes 
that the force of one body acting on another and the force exerted 
by the latter on the former are equivalent in magnitude. Since we 
are usually interested in one particular body, we focus our atten- 
tion on the force acting on it; the other force is called the force of 
counteraction or the force of reaction. The forces of action and 
reaclion are equal in magnitude but are oppositely directed. This 
proposition has become known as Newton’s third law of motion. 

Relativity of Motion. A body at rest in one system of coordinates 
may appear to us, from another viewpoint, to be moving. The 
uniform motion of a person walking along the platform of a station 
will appear nonuniform if described in a system of coordinates 
based on a braked train. Therefore, when speaking about the law of 
motion, the frame of reference for which this law holds must be 
indicated. The system for which Newton’s laws are valid must, 
without fail, satisfy the following conditions: a body on which no 
forces are acting must move rectilinearly and uniformly or must be 
at rest. Such a-system is called an inertial system. 

Thus, it is evident that all frames of reference that are executing 
accelerated motion with respect to a body on which no forces are 
acling are not inerlial systems. Another important conclusion that 
immediately follows is that there is not merely one inertial system. 
In fact, an infinite number of inertial systems exist. An inertial 
system can be based on any body moving uniformly and rectilinear- 
ly with respect lo some particular body on which no forces are act- 
ing. 

Let us assume that an inertial system has been selected. Newton’s 
law, I” = ma, is valid for any body moving in this system with 
a velocity v and acceleration a. Now, let us consider another frame 
of reference moving rectilinearly and uniformly with a velocity a with 
respect to the inertial system. To be sure, in this system, the same 
body will have a different velocity, equal to the difference between 
the velocity ¢ and the velocity a, the motion of the second system 
with respect to the first. However, since the relative motion of these 
two systems is rectilinear and uniform, the acceleration of the 
body will be the same in both systems. Expressed mathematically, 
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since acceleration is the derivative of velocity and the derivative of 
a constant quantity is equal to zero (=): 


dv d(v—uw) 
: We ae 

The acceleration of a body enters into Newton’s Jaw, but the 
velocity does not. As a result, the fundamental law of mechanics 
is exactly the same in both systems. 

This important proposition, following from Newton’s law of 
mechanics, is called the principle of the relativity of motion. It can 
be summarised as follows: An infinite number of inertial systems 
exist, and, in such systems, the law of inertia and the # = ma 
law are satisfied. In this respect, none of these systems has any 
special advantage over the other systems. All inertial systems are | 
equally suitable for the description of physical phenomena. 

The principle of relativity was first formulated by Galileo. 

Laws of Mechanics in a Noninertial System of Coordinates. Let us 
assume that the statement “acceleration is due to forces” is always 
valid in every system of coordinates. In noninertial systems of 
coordinates, a body executes accelerated motion even when il is not 
interacting with other bodies. But if this'is so, noninertial systems 
possess, in addition to forces due to interaclion, forces of different 
origin, i.e., forces resulting from the noninertial character of the 
system. These additional forces are called inertial forces (although 
it would actually be more correct to call them noninertial forces). 
Since inertial forces do not result from interaction, they do not 
satisfy Newton’s third law of motion. 

We shall limit ourselves to a simple example of inertial force, 
for in this book we do not intend to employ noninertial systems 
of coordinates in the analysis of motion. 

Let us assume that, for certain reasons, it is convenient to select 
a system of coordinates moving with an acceleration a, having 
a constant magnitude and direction. All bodies at rest or moving 
uniformly with respect to inertial systems will move with an accel- 
eration —a@ in relation to the noninertial system selected. The 
acceleration —a is produced by a force —mda. 

This is the inertial force for the case under consideration. It is 
not the result of the interaction of bodies, but is due to the accelerat- 
ed motion of the reference system. 

If the body under consideration in the noninertial reference 
system interacts with other bodies, the inertial force is added to 
the forces due to interaction. 

The fundamental law of mechanics in noninertial systems of 
coordinates is wrillen in the form: 

ma =F + inertial forces, 
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where # is the resultant force due to the interaction of the 
bodies. 

The expression for the inertial forces will vary in accordance 
with the nature of the motion of the noninertial reference system 
(rectilinear, circular, circular with accelerated speed, etc.).. Formu- 
las for inertial forces in a variely of cases can be found in books 
on theoretical physics. 


4. Application of the Fundamental Law of 
Mechanics to Accelerated Rectilinear Motion 


In this section, we give several elementary examples illustrating 
the physical meaning of the fundamental law of mechanics, which 
states that the vector sum of the forces acting on a body is equal to 
the product of the mass of the body and the acceleration, and its 
direction is that of the acceleration. 

Horizontal Motion Under the Action of a Constant Foree. An 
engine moves a trolley located on rails. Two forces act on the 
trolley in opposite directions—2’,;, the frictional force exerted by 
the rails, and #,, the force exerted by the engine. If these two 
forces are equal, the trolley moves uniformly. In order for the 
trolley to accelerate, the resultant force must be directed parallel 
to a. Therefore, to produce accelerated motion, the motive force 
must be greater than the frictional force. Moreover, the difference 
between these forces is the resultant force, which according to the 
fundamental law of mechanics is equal to the product of the mass 
and the acceleration. Thus, 


FF. — Fy, = ma. 


The frictional force is the result of the interaction of the rails 
with the trolley. Therefore, coupled with F,, is the force exerted 
on the rails (F;,). Similarly, coupled with Fe; is Fye, the force which 
the trolley exerts on the engine. 

The force F;, is the resistance force overcome by the engine 
(experienced by and acting on the latter). This is the force that 
would act on a man’s muscles if he were the source of motive 
power. As can be seen, the resistance force Fy consists of two terms: 
the frictional force and the quantity—ma, which can be called the 
inertial resistance. Inertial resistance is always related to the effective 
force acting on the accelerated body. It is equal to ma in magnitude 
but is oppositely directed to it. The inertial resistance could also 
be a single force acting on the accelerated body, which would be the 
case here if there were no friction. 

Let ug consider another example of horizontal motion under 
the action of a constant force. The load under consideration is placed 
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on a moving trolley having a back stop (Fig. 7). If there were no 
back stop, the load could slip off the trolley when the motion is 
accelerated. With no back stop, the fate of the load depends on the 
interaction between the trolley’s floor and the load. This interaction 
involves only friction. The trolley moves with a small acceleration @ 
and the force acting on the load, i.e., the frictional force, should be 
equal to ma*. But the static frictional force cannot have any magni- 


tude ‘whatsoever: It must be somewhat less than F7,"*. If 


ma > FF", 
motion with acceleration @ becomes impossible and the load slides 
off the trolley. If there were no friction between the load and the 


Fig. 7 


floor of the trolley, the load would not move from its place, i.e., 
the trolley would move out from under the load. Let us now assume 
that the trolley has a back stop. The load is then prevented from 
sliding as soon as it comes into contact with the back stop. The 
back stop now pulls the load with the force # = ma. The force 
coupled with the motive force is the inertial resistance experienced 
by the back stop. It is also equal to ma, but is directed oppositely 
to the acceleration and acts on the back stop. 


Examples of force. The force accelerating a passenger car is ~200 kg = 
= 1,960 newtons (1 newton is the force imparting an acceleration of | metre/sec? 
to a mass of 1 kg; 1 newton = 108 dynes = 0.102 kg). The thrust developed 
by the jet engine of a modern aircraft is 10,000-20,000 kg = 10°-2 x 10° new- 
tons and the tractive force developed by a T3-3 diesel locomotive is ~10,000 kg. 


Vertical Motion of an Elevator. Let us consider the forces acting 


on a load located on the floor of an elevator executing nonuniform 
motion. 


* Jn accelerated motion, if some body is carried along merely because of 
friction (the body carried along is at rest with respect to the carrier), the stat- 
ic frictional force will always have the direction of the acceleration. 
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Assume the elevator is accelerated upward (Fig. 8). Two forces 
acl on the load: the Earth’s force, Fg,, and the force exerted by the 
elevator’s floor, F.,;. But now the resultant force must be different 
from zero, so Fy, + F... Since the resultant force will be in the 
direction of the acceleration, Fe; > Fy, and 


Fa—Fr =m. 


The force F,, is simply the gravitational 
force exerted by the Earth on the load. Thus, 


F.y—mg =ma. 


The magnitude of the force exerted by the 
load on the elevator, F7¢, is exactly the same 
as Fo; thus, the resistance experienced 
by the elevator in lifting the load is 


Fey =mg + ma. Centre of gravity 


We see that this resistance consists of the 
weight of the load and the inertial resist- 
ance. The force Frye is sometimes called the 
apparent weight. 

The result obtained is for the case when 
the acceleration of the elevator is directed 
oppositely to that of gravity. This condi- Fig. 8 
tion is satisfied not only when the eleva- 
tor is being accelerated upward, but also when it is decelerated 
during its downward motion. 

When the direction of the gravitational force and the acceleration 
of the elevator coincide, the force exerted by the load on the ele- 
vator (apparent weight) is 


Fi,e= mg — ima. 


From this formula, it is evident that the pressure against the floor 
of the elevator ceases when a = g, i.e., when the elevator falls 
freely in the gravitational field. In this case, the body in the falling 
elevator ceases to press against the floor and stretch the cable, 
i.e., the body, so to speak, ceases to have weight. 

Force on a Freely Hanging Load. Let us consider the motion of 
a plumb bob suspended from a trolley executing accelerated motion. 
For such motion, the string by which the plumb bob is suspended 
forms an angle with the vertical. Two forces act on the load: #.,, 
the tension on, the string, and I’m, the Earth’s attraction, which 
is equal to mg (Fig. 9). These forces are directed at an angle to 
each other. According to the fundamental law of mechanics, their 


3~1409 
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vector sum is equal to ma and its direction is that of the accelera- 
tion. The diagonal of the parallelogram formed by the forces #5: 
and Fg; is, therefore, horizontal: 


ma= Fy. + Fr. 


The force coupled with Fg, is exerted on the Earth and does not 
interest us. We are, however, interested in the force Ij,, i.e., the 
force with which the load 
stretches the string. The value 
of this force, exerted on the 
string, is: 

FF), = —ma+ Fer. 


Thus, in this example too, 
the inertial resistance is a com- 
ponent part of the total re- 
sistance experienced by the 
accelerated body. 


5. Application of the 
Fundamental Law 
of Mechanics to Circular 
Motion 


Motion in a circle is accel- 
erated motion. If a body 
moves in a circle with constant 

Fig. 9 angular velocity, the magni- 
tude of its acceleration is equal 
to w*R and its direction is inward along the radius. 

While executing uniform motion in a circle, the body may be 
under the action of any number of arbitrarily directed forces. 
However, in accordance with the fundamental law of mechanics, 
the vector sum of these forces, or, simply, the resultant force, must 
be directed inwardly along the radius (parallel to the acceleration) 
and the value of its magnitude must be 


mv2 
Feentrip = x moh. 


The resultant force acting on the uniformly rotating body is called 
the centripetal force. We again emphasise that the resultant force 
always has the direction of the acceleration and not of the velocity, 
i.e., in our case, the force producing uniform circular motion is 
directed along the radius toward the centre of the circle and not 
along the tangent to the circular path. The role of the centripetal 
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force is to continually deflect the body from the rectilinear path 
along which it would move, as the result of inertia, if this force 
were not present. 

Example. A particle of mass m caught on a blade of a modern steam turbine 
(3,000 rpm, radius about 1 metre) experiences a centripetal force F = mo*r = 
=m X (314)? X 100 = m X 107 dynes (where m is in grams). The weight 


m xX 10% 
m 


of the particle is equal to mg. Thus, the centripetal force is , or about 


10,000 times, the weight of the particle. 


If a body is given accelerated motion, then, in conformity with 
the law of action and reaction, the accelerated body should act on 
other bodies (constraints) that make it 
accelerate rather than move in accordance 
with the law of inertia. The force of the 
accelerated body on the constraint has 


been called the inertial resistance. Such YY 
a force also exists, of course, for circular Y Uf 
ra ff, 


motion—it is called the centrifugal force. a 
Centrifugal force and centripetal force = geet tar! 

are equal in magnitude but are opposite- ni oom 

ly directed. The centrifugal force is 

applied to the constraints of a body exe- l 

cuting circular motion or, in other words, 

is applied to those bodies making the Fig. 10 

body under consideration move in acircle, 

and preventing it from moving rectilinearly and uniformly. As in 

the case of centripetal force, centrifugal force is a resultant—the 

sum of all the reactions exerted by a rotating body on its con- 

straints. 

Let us consider several examples, limiting ourselves to the simple 
case of circular motion due to the interaction of two bodies. If 
body A prevents body B from moving rectilinearly and uniformly, 
and makes it move uniformly in a circle, Fy, is the centripetal force 
and Fy, is the centrifugal force. Such simple interaction occurs 
between a body located on a bowlshaped pedestal, rotating about its 
axis in the horizontal plane, and the pedestal itself (Fig. 410). If 
the frictional force is not very large and the pedestal is rotating 
rapidly, the body slides to the wall of the bowl. In this case, the 
interaction between the body and the pedestal consists in the 
following: the wall of the bowl acts on the body inwardly along 
the radius (centripetal force), and the body, with a force of equal 
magnitude, presses against the wall outwardly along the radius 
(centrifugal force). 

We return now to the initial moment in this experiment. The 
body is lying on the pedestal and the pedestal has just begun to 
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rotate. If there were no interaction between the body and the 
pedestal, the body would remain in place and the pedestal would 
rotate under the body. The presence of static friction prevents 
this from happening. The body rotates together with the pedestal. 
Moreover, as was indicated in the previous section, the static fric- 
tional force will be directed inwardly along the radius. The static 
frictional force is the only force impelling the body to rotate, i.e., the 
frictional force in this 
case is acentripetal force. 
Therefore, 


F py a. Feentrip+ 


The centrifugal force is 
exerted by the body on 
the pedestal and is thus 
directed outwardly along 
the radius. If for the 
: purpose of clarity (it 
Fig. {4 should be remembered, 
however, that this is a 
very gross picture) friction is conceived as being due to the engage- 
ment of two rough surfaces, whereby the surface protuberances 
of the first body mesh with those of the second, then the cen- 
trifugal force constitutes a force acting outwardly along the radius 
at the meshed points of the pedestal surface. 
The frictional interaction maintaining the body fixed with 


respect to the pedestal must be less than a certain maximum /7},"*. 
In increasing the velocity of rotation of the bowl, the vale of 


mw*R finally becomes greater than F7;"", which makes it impossible 
for the body to execute circular motion with acceleration | a | = 
= w?R. Indeed, to secure circular motion with angular velocity o, 
a force mw?R must act on the body. If the frictional interaction cannot 
provide this force and, hence, motion in a circle of radius R with 
angular velocity a, the body moves with respect to the pedestal 
and static frictional interaction ceases to exist between the body 
and the pedestal. 

As soon as interaction between the body and the pedestal ceases 
and the body becomes free, rectilinear and uniform motion begins, 
the velocity being that possessed by the body at the moment of 
dissociation. Since the velocity of a body moving in a circle is 
directed along the tangent, this line represents the line of motion of 
the freely moving body. The tangential path of particles dissociated 
from a rotating body is very clearly demonstrated in the case of 
particles flying from a rotating grindstone. 


5. Circular Motion 37 


Let us now consider the rotation of a stone tied to the end of 
a string (Fig. 11). In order to uniformly rotate the stone at the end 
of the string under normal conditions, tangential acceleration as 
well as centripetal acceleration must be imparted to the body. The 
tangential acceleration is necessary in order to overcome the fric- 
tion with the air. The resultant acceleration—and, hence, the 
force—is not directed along the radius, but forms an acute angle 
with the direction of motion. The hand executes rotational motion 
and the string is directed at 
each instant along the tan- 
gent to the circle described 
by the hand. 

As another example of 
circular motion, let us con- 
sider the rotation of two 
attracting bodies having the 
same angular velocity about 
a common centre. By means 
of a centrifugal machine, it 
is not difficult to make two 
bodies of equal mass, 
joined by a string, revolve 
about a common axis. ; 

To begin with, let us con- Fig. 12 
sider the first body, with a 
string attached to the rotating shaft. The centrifugal force acting 
on the shaft is equal to m,o°R,. Similarly, the second body acts 
on the shaft with aforce m2w*Re. If these forces are equal, the strings 
could be joined to each other as shown in Fig. 12, for nothing 
would change thereby. It is thus clear that the condition for stable 
rotational motion of two bodies joined by means of a string is the 
equality of the centrifugal forces exerted on the string by these 
bodies: 


m wr R, => mow? Rs. 
Thus, . 
i ie, 
mo =a Ry : 
i.e., stable rotation takes place only when the ratio of the distances 
to the axis of rotation is inversely proportional to the masses of 
the bodies. 
The point dividing the distance R,+ R2 in the ratio Fe = ~ 
5 | 
(Fig. 12) is called the centre of mass (see Sec. 15). It can be stated 
that stable rotation of two joined bodies takes place about the 
centre of mass of the system. 
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We have spoken about two bodies whose interaction is achieved 
by means of a string. However, the above is also completely valid 
when two bodies attract each other in accordance with the law of 
universal gravitation, or when a positive anda negative charge 
attract each other. Thus, interaction of any kind between two 
attracting bodies can produce stable rotation about the centre of 
mass of the system. This interaction is given by two forces applied 
to the attracting bodies. The forces are oppositely directed but 
numerically equal. (At this point, the unsophisticated reader will 
usually ask: Why do the bodies not attract each other? We repeat: 
The forces are parallel to the accelerations, not to the velocities, 
and in circular motion the acceleration is directed along the radius 
toward the centre of rotation.) Since a single force acts on each 
body, both are centripetal forces. At the same-time, both are also 
centrifugal forces. Thus, body A acts as a constraint for body B, 
and vice versa. In other words, for body A, Fx, is a centripetal 
force while F4, is a centrifugal force, and vice versa for body B. 
However, the concept of centrifugal force is used here in a complete- 
ly formal sense. It was introduced only in order to emphasise the 
similarity existing between a system of spheres joined by a string 
and a system of bodies “joined” by a force of attraction. 

A planetary system is an example of stable rotation of attracting 
bodies. Let us assume that the Sun had only one planet—the Earth. 
The centre of rotation would then divide the line joining. the Sun 
and the Earth in the ratio mgun : Mparth = 330,000 : 1. 

Thus, when it is ordinarily said that the Earth rotates about the 
Sun, we are not committing a serious error, and this would be so 
even if the Earth were the Sun’s only planet. 
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The motion of the Earth is complex. It revolves about its axis 
and, at the same time, moves in an orbit about the Sun. Hence, it is 
clear that the Earth does not constitute an inertial frame of ref- 
erence. Nevertheless, under conditions prevailing on the Iarth, 
Newton’s law is generally quite satisfactory. In a number of cases, 
however, the noninertial property of the Earth’s frame of reference 
has an appreciable effect on the phenomena being studied. These 
cases should be investigated. 

Effect of the Earth’s Rotation on Its Form. Weight of a Body. 
If the Earth’s rotation is not taken into consideration, a body lying 
on the surface of the Earth can be considered at rest. The sum of 
the forces acting on this body would then be equal to zero. As a mat- 
ter of fact, any particle on the Earth’s surface lying at latitude 
moves with an angular velocity wm = 0.7292 x 10-* sec"! about 
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the globe’s axis, i.e., in a circle of radius r= R cos g (R is the 
radius of the Earth, which is assumed to have, to a first approxima- 
tion, the shape of a sphere). Therefore, the sum of the forces acting 
on such a particle differs from zero. It is equal to the product of the 
mass and the acceleration, w*R cos g, and is directed along r. 

It is clear that the presence of such a resultant force OG 
(Fig. 13) is possible only when the reaction of the Earth’s surface 
OA and thegravitational force 
OE are directed at an an- 
gle to each other. The body 
will then press on the Earth’s 
surface (according to New- 
ton’s third law) with a force 
OC = —OA. If the globe 
were at rest, this force would 
be equal to the gravitational 
force OE and would also 
coincide with its direction. 

Let us resolve the force OC 
into two components — one 
force directed along the ra- 
dius OD and the other along 
the tangent OB. As can be 
seen from the figure, the 
Earth’s rotation results in 
two effects. First, the weight 
(the body’s pressure on the 
Earth) becomes less than the 
gravitational force. Since OC ~ OD, this decrease equals DE = 
=mRw? cos’p. Secondly, a force is produced tending to flatten 
the Earth, i.e., shift matter toward the equator. This force 
OB =mRa’* cos sing. Such flattening has actually taken 
place, for the Earth’s shape is not spherical but close to an ellipsoid 
of revolution. As a result of this effect, the equatorial radius of the 
Earth is 1/300 greater than the polar radius. 

The flattening force tended to redistribute the mass of the globe 
as long as the latter’s form was not in a state of equilibrium. When 
this process was completed, the flattening force evidently ceased 
to be effective. Hence, the force exerted on the terrestrial “globe” 
is directed normal to the surface. 

Let us now return to the quantity expressing the pressure of the 
body on the Earth, i.e., to the physical quantity generally called 
weight. The calculation made for a sphere (gravitational force minus 
mRw? cos* g) is naturally not valid for the actual shape of the Earth. 
However, for approximate calculations, this value can be used. 
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At the poles (p = 90°), the weight of the body is equal to the 
gravitational force. Let us denote the gravitational force of the 
body at the poles by mg. As indicated above, the pressure exerted 
by the body on the Earth’s surface at any point of the globe, in 
other words, the weight of the body, will be equal to the difference 
between the gravitational force and the force DE, i.e., 


mg — mR? cos? ~ = mg’. 
Thus, 
g’ = g— Rw? cos? 2 


is the acceleration with which a body falls at the latitude @. At the 
equator, g’ is 1/300 less than g. 
If we use an appropriate value for the acceleration of a freely 
falling body at each of the various latitudes, we do not have to 
calculate the effect of the 
Earth’s rotation on the weight 
of the body. 
Effect of the Earth’s Rota- 
@) tion on the Motion of a Body 
on the Earth’s Surface. Let us 
assume that the motion ofa 
body is observed in a rotating 
system of coordinates. The 
body moves rectilinearly and 
uniformly past the observer, 
and the motion is curvilinear 


V in the selected noninertial 
frame of reference. Coriolis, 
the French scientist, showed 

cor by means of calculations that 


relative to a system rotating 
with angular velocity w a 
body moving rectilinearly and 
uniformly with velocity vhasan 
acceleration equal to 2vo sin a, 
where a is the angle between the axis of rotation and the direction 
of the rectilinear motion. The acceleration is directed perpendicular 
to the plane passing through the axis of rotation and the direction 
of the velocity. We may use the following rule to determine which 
of the two possible directions is that of the acceleration. If one looks 
along the axis of rotation in the direction that makes the rotation 
appear counterclockwise and places his left hand palm down with 
the fingers pointing in the direction of the rectilinear motion, the 
thumb will point in the direction of the acceleration (Fig. 14). 


Fig. 14 
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The Coriolis acceleration @c,,; acts on all bodies moving on the 
Earth’s surface. If one looks along the Earth’s axis from the North 
Pole, the rotation appears counterclockwise. Hence, in the Northern 
Hemisphere, any body moving rectilinearly relative to an inertial 
system will deviate to the right (as viewed by a terrestrial observer) 
in the course of its motion, while in the Southern Hemisphere, it 
will deviate to the left. This deviation could be large or small, 
depending on the direction of motion with respect to the axis and 
on the linear velocity of the motion. 

The deviation of the body can take place in the horizontal or in 
the vertical plane (with respect to the surface of the Earth). The 
Coriolis acceleration is directed perpendicular to the Earth’s axis; 
hence, the deviation taking place in the horizontal plane is greatest 
at the poles and equal to zero at the equator. The reverse is true 
for deviations in the vertical plane. The deviations in these two 
planes determine the corresponding projections of the acceleration 
vector. Thus, the projection of the acceleration of the body in the 
horizontal plane is 

2vo sin @, 


where q is the latitude. In the Northern Hemisphere, this projec- 
tion is directed to the right of the motion. 

The deviation of bodies moving in the horizontal plane from 
their rectilinear path is the reason why the right banks of rivers 
are eroded in the Northern Hemisphere, and the left banks in the 
Southern Hemisphere. For the same reason, rivers in the Northern 
Hemisphere by-pass obstacles to the right and in the Southern 
Hemisphere to the left. 

Air masses flowing into regions of low pressure deviate from the 
radial direction to the right in the Northern Hemisphere (to the 
left, in the Southern Hemisphere) and form cyclones. Thus, cyclones 
in the Northern Hemisphere stir the air masses counterclockwise, 
and in the Southern Hemisphere clockwise. 

As a result of the vertical deviation, a falling body does not 
fall exactly vertically. Such a body deviates from east to west (the 
Earth rotates from west to east, i.e., counterclockwise if viewed 
from the North Pole). 


Examples. 1. Let us calculate the maximum deviation of a normal artillery 
shell. The deviation will be a maximum at the poles (p = 90° and for all firing 
directions ~ = 90°). If we take the velocity of the shell to be 1 km/sec, we 
obtain a deviation of 2 X 1,000 x 0.73 x 1074 = 0.15 m/sec?. The gravi- 
tational acceleration is about 70 times greater than this acceleration. As can 
be seen, the deviation of the shell from its rectilinear path can attain a magni- 
tude of the order of several centimetres per second. 

2. Assume that a river is flowing from north to south (in the Northern 
Hemisphere) with a velocity v = 3 km/hr. Thus, the water moves from a region 
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of small linear velocity of rotation of the Earth's surface to a region of larger 
linear velocity. This increase in the velocity of motion (directed from west to 
east, together with the banks of the river) is determined by the Coriolis accel- 
eration and is due to the action of the right bank of the river on the mass of 
water. Let us calculate the Coriolis acceleration for the latitude pm = 45°: 
2Cor = 2UWq Sin |, 
Wo = 2x rads/day =7.25 10-5 rad/sec, v=3 km/hr=0.83 m/sec, 
@Cor = 2X83 X7.25 x 10-5 K0.707 = 850 10-5 = 8.5x10-3 cm/sec?. 
Thus, on every ton of water the right bank exerts a force of 
8.5x 10-3 
980 


The steep right banks of the Volga, Don and other big rivers of the Northern 
Hemisphere illustrate this effect. 


x10 gm =8.7 gm. 


7. Data Necessary for the Solution of Problems in Mechanics 


Tihe basic problem of mechanics is the determination of the motion 
for given forces. To determine the motion means to be able to 
indicate the location in space and the corresponding instant of time 
for any of the particles. If we are concerned with a complex mechan- 
ical system, then such data are necessary for each of the particles 
into which this system can be considered divided. 

In order to tackle such a problem, we must, in the first place, 
have complete data on the effective forces. The forces must be known 
for every particle and for every location of this particle. If these 
forces are known, then by means of Newton’s equations we can 
determine the acceleration of the particle. However, Newton’s equa- 
tions of motion alone are insufficient to completely determine the 
path, the velocity and the instant of time corresponding to the 
passage through a given point in space. To describe the motion, 
it is necessary to know for each instant of time the location of the 
particle and the magnitude and direction of the velocity. In all, 
six quantities must be given: three coordinates and the three projec- 
tions of the velocity on the axes. These data uniquely describe the 
“mechanical state” of a particle and may be called the parameters 
of state. 

Thus, the problem reduces to the determination of the parame- 
ters of state, for Newton’s equations only give the acceleration. 

To solve the problem, the initial conditions must be known, i.e., 
the values of the parameters of state for some instant of time (this 
instant is usually designated by t = 0, whence the designation “ini- 
tial conditions”). If the initial values of the parameters of state 
are known, the rest is merely a matter of mathematics. Newton’s 
equations of motion plus the initial data suffice to uniquely solve 
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the mechanical problem. In principle, the future motion of the 
particle as well as the past motion can be established for any desired 
period of time. This concept amazed scientists at one time. Laplace, 
the great French scientist and thinker, once said: If we knew 
the initial coordinates and velocities of all the particles comprising 
the world, we would be able to predict the fate of the world. This 
somewhat naive viewpoint, reducing all reality to purely mechanical 
phenomena, is not valid in principle, and not merely because it is 
practically impossible to obtain the required data. Mechanics, based 
on Newton's laws, has limited application and its conclusions cannot 
be applied that broadly. 

Let us return, however, to the six initial conditions. The need 
for giving precisely six quantities for a particle is evident from 
Newton’s equations themselves. 

The vector equation can be resolved into its components and 
written in the form of three equations: ma, = Fy, ma, = F, and 
ma, = F,. To determine the motion, it is necessary to establish how 
the particle’s three coordinates z, y, 2 vary with time. In order to 
establish the dependence of the coordinate z on time, we must 
integrate the equation 

dvx 


m —; = Fi 


The first integration enables us to find the z-component of the 
velocity. Upon integrating, we obtain the first constant of integra- 
tion. The second integration enables us to find the coordinate z as 
a function of time, and the second arbitrary constant is obtained. 
The above also holds true for the equations of change with respect 
to time for the other two coordinates. In all, six arbitrary constants 
are obtained. These may be determined only if six independent facts 
about the coordinates and velocities of the particle are known. 

As we have indicated, the initial conditions consist of the three 
initial coordinates and the three projections of the initial velocity. 
However, the problem could also be solved if six other quantities 
are known. For example, we may be given the three coordinates 
of the initial point, the numerical value of the initial velocity, 
and two coordinates of the final point. The path of the particle 
is also uniquely determined by these six conditions. 

The parameters of the particle may be given in a variety of 
ways. The location of the particle in space may be given by three 
Cartesian coordinates or by the distance from the origin of the coor- 
dinate system and two angles formed by the radius vector with 
the axes. Similarly for the velocity. 

A typical example of the dependence of a body’s motion on the 
initial conditions is the behaviour of a rocket fired from the sur- 
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face of the Earth. The trajectory of the rocket and its destiny is 
determined by the firing direction, the geographical Iocation of the 
launching site and the magnitude of the initial velocity. As is well 
known, for small firing velocities from the Earth, a body has a para- 
bolic trajectory. For a velocity of about 8 km/sec, equilibrium 
is achieved between the centrifugal force and the gravitational 
force, and the launched body may be placed in a circular orbit. 
For velocities between 8 and 11.2 km/sec, the launched body moves 
in an elliptical orbit about the Earth. At an initial velocity of 
about 11.2 km/sec, the kinetic energy of the body becomes sufficient 
to completely overcome the Earth’s gravitational attraction. 
A rocket launched with such a velovity will have a hyperbolic 
trajectory. 

If the mechanical system consists of n independent points, the 
number of parameters for the system will be equal to 6n. 

In some cases, however, constraints which serve to decrease this 
number may be placed on the mechanical system. A simple example 
is a centrifugal regulator, which may be considered as a system 
consisting of two joined spheres that can slide apart and turn about 
a common axis. It is clear that, given the distance of a point from 
the axis of rotation and the azimuthal angle with respect to an 
arbitrary line, we can uniquely determine the mechanical state of 
the system. Two “coordinates” and two velocities of change of these 
coordinates constitute the parameters of this state. 

Let us now consider an arbitrarily rotating solid body and deter- 
mine the data required to fix its position with respect to a station- 
ary system of coordinates. It is clear that the centre of mass of 
the body is determined by three quantities. To describe the body’s 
rotation, three angles suffice. We need not elaborate on this point, 
for it is evident that by means of three rotations about mutually 
perpendicular axes any desired orientation of a body can be achieved. 

Thus, the solid body requires twelve parameters—six coordinates 
and six velocities of change of these coordinates. 

As another example, let us consider two rigidly joined points. 
If they were free, six coordinates would be required to describe 
them. Since they are rigidly joined, an additional condition relating 
the coordinates of these points exists, namely: 


(X1— £2)? + (Ys — Yo)? + (24 — 22)? = Const. 


Thus, five independent quantities are required to describe this system. 
In all, there are ten parameters—{five coordinates and five veloci- 
ties of change of these coordinates. 

Since the parameters of state are always equally divided between 
“coordinates” and velocities of change of the “coordinates”, it is 
customary to speak of the degrees of freedom of a system, whereby 
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we mean the number of independent coordinates required to describe 
the system. Thus, one point has three degrees of freedom, two 
rigidly joined points have five degrees of freedom, a solid body— 
six degrees of freedom, a system consisting of nm independent points— 
3n deerees of freedom, etc. The meaning of the following proposition 
should now be clear: The mechanical state of a system is described 
by giving its parameters in terms of the number of degrees of freedom. 


8. Constants of Proportionality and Dimensions 
of Physical Quantities 


The coefficient y in the expression for the law of universal gravi- 
tation is a universal constant depending on the choice of units for 
force, mass and distance. It is possible to choose the units in such 
a manner that y = 1. This would require that the unit of mass be 
equal to the mass of a particle attracting a similar mass at unit 
distance with unit force. In the CGS system of units, such a mass 
would be equal to 1.5 x 10? gm, i.e., 15 tons. 

Thus, universal constants in formulas of physics depend on the 
specific choice of units. If we desired, we could eliminate all such 
constants from formulas of physics by appropriately choosing the 
units. 

It is important to grasp the concept that the employed system 
of units and the constants of proportionality in formulas are inter- 
connected. We can demonstrate this interconnection by dimensional 
formulas. First, the number of units that we wish to consider funda- 
mental must be established. This number depends entirely on us 
and is determined exclusively by considerations of convenience. 

A widely used system of units in physics is based on the units 
of length (ZL), mass (AZ) and time (7) as the independent quantities. 
The values of all universal constants and the units of measurement 
of all other quantities are then uniquely determined by the choice 
of units for L, M and 7. The nature of this relationship is given 
by so-called dimensional formulas. Several examples will make their 
meaning clear. The dimensions of velocity are LT~', acceleration— 
LT~-*, foree—MLT~*, the gravitational constant—M4ZL°T-, elec- 
tric charge in the formula for Coulomb’s law—M'2L3/7-1, etc. 
Knowing these formulas, we can immediately say how the numerical 
values of the universal constants and the units of derived physical 
quantities vary when the magnitude of some fundamental quantity 
is changed. 

As we shall see by examples in Sec. 81, dimensional analysis of 
physical quantities can be used to predict the nature of some depend- 
ence or other between physical quantities. ; 
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In addition to the system based on distance, time and mass, 
a system in which the fundamental quantities are distance (LZ), 
time (7) and force (F) is widely used. This is known as the FLT 
system. Naturally, the dimensional formulas in this system will not 
always be the same as above. Thus, moment of force in the FLT 
system has the dimensions FL, and in the MLT system the dimen- 
sions ML*T~*. Mass, being a derived quantity in the FLT system, 
has the dimensions FL-!7?. : 

The fundamental law of mechanics relates the quantities of force, 
mass, distance and time. Therefore, the value of the constant of 
proportionality in this formula depends in both systems on the 
choice of units. In both systems, a constant of proportionality 
equal to unity is assumed. This means that in the MLT system, 
using the formula F = ma, the unit of force is chosen so that F = 1 
when the mass and the acceleration are equal to unity. In the FLT 


system, using the formula m = f , the unit of mass is chosen so 


that m = 1 when the force and the acceleration are equal to unity. 

In this book, we shall for the most part use two variants of the 
MLT system: 

CGS system: L—centimetre, M—gram, T—second; 

MKS system: L—metre, M—kilogram, 7'—second. 

In the CGS system, the unit of force is the dyne = 1gm xX cm/sec? 
and the unit of work is the erg = dyne X cm. In the MKS system, 
the unit of force is the newton = 1 kg X m/sec” and the unit of 
work is the joule = newton X metre. 

If the reader is confronted with data expressed in the FLT sys- 
tem, these data should be converted into one of the indicated 
systems. To do this, he only need recall that a unit of force in the 
FLT system is a kilogram (the weight of a kilogram mass at sea 
level at 45° latitude), which is related to the two units of force 
adopted by us as follows: 


1 kg=9.81 newtons= 9.81 x 105 dynes. 


We shall return to the subject of systems of units when we con- 
sider electrical quantities. 


CHAPTER Il 
MECHANICAL ENERGY 


9. Work 


Motion without acceleration (i.e., rectilinear and uniform) may 
take place either without the action or with the action of forces on 
the body. In the latter case, the sum of the forces acting on the 
body is equal to zero. There is an essential difference between these 
two kinds of motion. In the first case, motion is not accompanied 
by work, while to achieve the second type of motion work must be 
expended. A motor works, moving an automobile uniformly and 
rectilinearly. A man works, 
moving a sleigh with its load f, 
uniformly and _ rectilinearly. 
We say in these cases that 
work is expended in overcom- 
ing resistance—friction, air 
resistance, etc. 

Of the two balanced forces 


acting on a body moving with- fa 

out acceleration, one is di- Deebtian oF bush 
rected along and the other op- paar 
posite to the direction of mo- 

tion. Fig. 15 


We say that a force acting 
along the direction of motion performs work. On the other hand, 
as regards a force directed opposite to the motion, we say that work 
is performed against this force. 

For quantitative evaluation, work is expressed as the product of 
the force acting on a body and the distance traversed by the body. 
The term work is the designation for this physical quantity. 

Let a body be acted upon by a number of forces whose vector 
sum is equal to zero. The body moves uniformly and rectilinearly. 
All the forces may then be resolved into four components (Fig. 15). 
Forces F, and F2 in accordance with the adopted definition, perform 
no work. Force F performs work equal to FAS, where AS is the 
traversed distance. The work of force F’ is equal to —FAS, where the 
minus sign indicates that work is performed against the force F’. 

Let us now consider the motion of a body with acceleration, i.e., 
curvilinear and nonuniform motion. As we already know, a resultant 
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force acts on the body, in this case, that is directed along the 
acceleration (but not along the path in the general case!). Let us 
again resolve all the effective forces into forces directed along the 
motion and perpendicular to the motion (Fig. 16). Now, F is not 
equal to F’, and F, is not equal to F.. Using the definition for 
work given above, we can assert, as before, that /, and F. perform 
no work. The work of force F’ is again negative, i.e., the work is 
performed against the force F’ and is equal to F’AS. Force F performs 
the work FAS, which is more 
than the work against the force 
of resistance. The surplus work 
serves to accelerate the body. 

The inequality between the 
forces F, and F2 shows that 
the motion is curvilinear. Their 
difference F.,—F, corresponds 
to the normal component of the 
acceleration. 

Let us consider an extreme 
case—uniform motion in a circle. 

Fig. 16 The resultant force forsuch motion 

is directed, as we know, along 

the radius of the circle, i.e., perpendicular to the direction of mo- 
tion. Therefore, the centripetal force performs no work. 

Thus, the surplus work in the general case of curvilinear accelerat- 
ed motion is not used to produce acceleration in general, but only 
the tangential component of the acceleration. For a particle, this 
can be expressed as follows: 


F—F’=ma; and FAS — F’AS = ma,AS. 


We repeat: (F — F’) is the tangential component of the resultant 


force F;". 


The work expended in accelerating a body (being equal, by defini- 
tion, to the projection of the resultant force on the direction of 
motion multiplied by the traversed distance) is equal to the product 
of the mass of the body, the traversed distance, and the tangential 
acceleration. The last equation above can be written in the form 
FAS = F’AS+ma,;AS and can be read as follows: The work 
performed by the effective force consists of the work against the 
force of resistance and the work expended in accelerating the body. 


f 


—__> 
Direction of motion 


Examples. 1. A jet passenger plane, having a weight P = 70 tons, attains 
a height h = 10 km. it it moves uniformly, the work performed in rising to 


this height is 
A, = Ph=7X108 kg-m = 68.6 108 joules = 68.6 1015 ergs. 
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If this height is attained over a path S = 85 km with a simultaneous increase 
in velocity (acceleration a = 0.3 m/sec?), the additional expenditure of work 
in producing acceleration will be 


Ag=maxS =17.9X108 joules= 17.9105 ergs=1.82x 108 kg-m. 
2. To plane a board 2 metres long and 20 centimetres wide, a joiner expends 
about 150 kg-m of work. 


10. Kinetic Energy 


Thus, in accelerating a body, the resultant force Fes performs 
the work 


A= Fr x AS =ma;,AS 


where’a; is the average tangential acceleration over the portion of 
path AS being considered. Substituting for a;, we obtain 
AvXAS 


ap mv X Av, 


-l=m 


where v is the average velocity which is equal to 2/. (v2-+v). 
Here vy, and v2 are the instantaneous velocities at the beginning 
and the end of the path respectively. Since Av = v2 — ry, then * 


9 4 9 
A mvs My fone A mus 
rs 3 2308 2 , 


i.e., the work is numerically equal to the increment in the value 
mv2 
2 


of . Therefore, the quantity 


- mv? 
K=—- 


is employed as a measure of the energy of motion of a particle. 
This quantity K will be called kinetic energy. The previous equation 
may now be read as follows: The work of the resultant force acting 
on a body (i.e., the product of the tangential component of the 
resultant force and the traversed distance) is equal to the increment 
in the kinetic energy of the body. This equation is convenient for 
the solution of elementary mechanical problems in which the path 
along which the force acts is given. 

We shall repeatedly be dealing with the term “energy”. It is one of 
the most important physical concepts. Energy, i.e., work capacity, 


* The same result is obtained if we write the expression for an infinitely 
small quantity of work in the form dA = mv dv and integrate it, from the mo- 
ment the velocity was v, to the moment-it reaches ve: 


va 
mv3 mv? Z 
A= mv dv = —y > ° 
vy 


4—1409 
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is a function of the state of a body. Work is produced at the expense of 
a decrease in the value of this function. Kinetic energy is a function 
of the state of motion. If the kinetic energy changes from K,; to Ko, 
then the work performed thereby is equal to K.—K,, independent of 
the nature of the motion. It is of no importance whether the velocity 
changed rapidly or slowly, uniformly or nonuniformly. The decrease 
in the kinetic energy by a specific amount always yields the 
same amount of work. 

Only in the case when the physical quantity is a function of slate 
can it have the sense of energy, i.e., a store of work. 


_ Examples. A unit of energy in atomic physics is the electron-volt (ev). This 
: on eeu energy of an electron accelerated through a potential difference 
oO volt: 


1 ev=1.6x 10-12 erg. 


The energy of a proton accelerated in a synchrotron is 10 Bey = 10!° ev =: 
= 0.016 erg. 

The kinetic energy of.a large jet passenger plane (m = 100 tons and v = 
= 800 km/hr) is 


2.5 1016 ergs = 2.6109 joules= 2.5 108 kg-m. 


11. Potential Energy 


Let us consider several phenomena in which the performed work 
is-not accompanied by a change in the velocity of the body. We 
shall be concerned with two types of problems. The first is related 
to the elastic deformation of a body, while the second deals with 
events occurring during the motion of a body in a gravitational or 
electric field. We shall presently show that in both cases we shall 
be dealing with the transformation of work into a special variety 
of energy, so-called potential energy. 

Elastic deformation phenomena will be treated first. Experiments 
show that for any elastic deformation—extension, compression, 
flexure, etc.—one can always find a function of state that increases 
precisely by the magnitude of the work performed on the body. 
This function of state or, in other words, the function of the body’s 
properties and degree of deformation is called the potential energy 
of elasticity. 

We shall show this energy to exist for a case of elastic deforma- 
tion, namely, linear extension or compression. Analogous examples 
could be given for any other kind of elastic deformation. 

Let some force, such as a muscular force, stretch a solid body, 
e.g., a spring, very slowly. The work expended in stretching the 
body from length J + s; to length 2 + s2, where J is the length of 
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the unstretched spring, is 
-l=F (s,—s}). 
The muscular force is balanced by the elastic force of the spring 


at every given moment. For a small extension of the spring, the 
latter is proportional to the deformation s*: 


Fea =ks. 
In the expression for work, we must use the average value of 
the force F, i.e., a (ks, -+ ks2). We then obtain**: 
4a bi ksi =a(48 is 


2 


a 


i.e., the work against the elastic force is expended in increasing 
F ks? é i : 
the quantity Ayr This quantity is, therefore, adopted as a measure 


of the elastic energy. The quantity 
Der peas ae 
will be called elastic potential energy. 

Elastic potential energy formulas for other kinds of deformation 
have exactly the same form. The body’s stiffness-with respect to 
a specific form of deformation is characterised by k, while s is 
a measure of the deformation (for example, twist angle, displacement 
angle, etc.). 

The quantity Ue; is energy in precisely the sense referred to at 
the end of Sec. 10. Irrespective of the manner in which a body is 
deformed and the rapidity of the process, the same amount of 


expended work will always correspond to one and the same incre- 
. », ks? ny ks? , 
mental value of the quantity =. Thus, = is a measure of energy 


or, to be more precise, of elastic potential energy. 


Examples. 1. The potential energy of a piece of steel wire (Young's modu- 
lus E = 21,000 kg/mm?) having a length of 50 metres and a cross-section of 
10 mm?, and which is stretched 1 cm, is 


ks? 
Vel = D 


= 20x 10% ergs = 2 joules = 0.2 kg-m. 


* It should be recalled that the law of elastic deformation (Hooke’s law) 
is written in the form - = E> , where £ is the modulus of elasticity and S 
is the cross-section of the stretched body. Thus, -the stiffness (the constant 


of proportionality in the expression for the elastic force) has the valuek = = : 


** Tho same result is obtained when we integrate the infinitely small amount 
of work dA = — ks ds between the limits s, and so. ; 
* 
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2. For rubber, Young's modulus E = 8 kg/cm*. A stone having a mass 
of 20 gm is shot from a slingshot to a height of 20 metres. This requires that 
an energy of 0.4 kg-m be imparted to the stone. Assume that the elastic band 
has an initial length of 40 cm and stretches an additional 40 cm. Let us deter- 
mine the required cross-section for the elastic band. 


ES s2 wu  -2%40 40 (k 
Ug —— a i >; = Est = ee 0.25 cm2. 
s(=%) 1,600 (cm?) 


Gravitational force possesses the same feature (discussed above) 
as elastic force. Thus, work expended in lifting a body in a gravi- 
tational field serves to change the body’s function of state. In this 
case, the function interesting us depends on the position of the 
given body with respect to the bodies attracting it. This function 
is called gravitational potential energy. 

We shall show this energy to exist, first, for a body located close 
to the Earth’s surface. From point 1, the body is moved to the 
higher point 2 along some curvilinear path. Let us divide this 
path into small segments, replacing the curved line by a broken line. 
The latter can be made to approximate the former to any desired 
accuracy. The work expended in moving a body along one of these 
linear segments of length dl is then 


dA=mgdlsina or dA=mg dh, 


where dh is the increase in height. Since mg does not cHange along 
the entire path of motion, we can place it before the brackets (before 
the integral sign when integrating) in writing the expression for 
the work expended along the entire path: 


A=mg (hz — 4), 


where h,; and hk,‘ are the heights of points 1 and 2 respectively. 
Furthermore, 


A= (ngh)s —(mgh), = A (mgh), 


i.e., the work of displacement is equal to the increase in the prod- 
uct mgh, which is a measure of the gravitational potential energy 
for this simple case. 
It is quite evident that 
U=mgh 

is energy and is in complete accord with the meaning we have 
assigned to this term. Irrespective of the manner in which the work 
is performed, i.e., the path taken by a body and the speed of the 
motion, the work of displacing the body from point 1 to point 
2 will always be the same, since the increase’ of energy depends 
only on the location of these points—in our simple case, on pies 
heights. 
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Since the work of displacing a body in a gravitational field does 
not depend on the path, the work of displacement along a closed 
curve will be equal to zero. 

It should be noted that it is immaterial what level we choose 
as our base for hk. If it is agreed to calculate h from the Earth’s 
surface, the potential energy of a body at the bottom of a well will 
be negative. 

The above formula is not valid for bodies that are far from the 
Earth, e.g., the Moon. Thus, as was explained in Sec. 2, mg, the 


approximate expression for the gravitational force, should be 


replaced for large distances by the exact expression mame 


Let us calculate the work done by the gravitational teres Work 
performed by the forces of a system will be considered positive, 
while work against the forces of the system will be considered 
negative. Let us assume that two attracting bodies draw together 
along the line of action of the forces over an infinitely small segment 
—dr of the path (minus, since 7 decreases). Thus, 


dA= — yt dr. 
But rsa (=) . Therefore, 
re r 
dA= —d (—y 72). 


Work takes place at the expense of a decrease in the value of 


U = —y™@™'"2 which is a measure of the gravitational energy in 
ae a 
the general case: 
dA= —adaU. 
The quantity 
T= ay mymg 


r 


represents gravitational potential energy in the general case. 

U is equal to zero if the bodies are infinitely far apart. When 
the bodies draw together, U increases in absolute value. But since U 
is negative, we see that, just as with the approximate formula for 
bodies close to the Earth, the potential energy is less the closer the 
attracting bodies are to each other. Naturally, if we desired, we 
could change the base line for U and make this quantity positive 
in the interval of values concerning us. . 

It is not difficult to show the relationship between the general 
formula for U and its particular case when U=mgh. Thus, replac- 
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ing r by R-+h, where R is the radius of the Earth, we obtain 


y= _ yum _ OR yA 
Rh <7 
{-+- R 
(M is the mass of the Earth). But since 4 is a small quantity, 
we can write with sufficient accuracy : = 1— + , whence 
1+ 
U= yay mgh. 


Changing the base line for U so thal zero potential energy for 
a body is at the Earth’s surface, the formula reduces to U = mgh. 


Example. To obtain a clearer picture of the meaning of the above results, 
let us calculate the potential energy of a body of mass m = 1 kg at the Earth’s 
surface and at a distance of 1,000 km above its surface. ' 

The potential energy at the surface of the Earth is 


Mm 4 5.8x 1027 x 103 
Es ~ $< — dS —-— [), J 5 1 . 
: YR 15x 106 6.3 x 108 6.1x10!4 ergs 


The potential energy at a height of 1,000 km is 


a 4 5.8 1027 x 103 
15x 108 7.3%108 


From the calculations, it is evident that 1!) the potential energy of a body 
‘in the Earth’s gravitational field is always negative and increases with its 
distance from the Earth. (since we have agreed that it tends to zero when 
h —» 0); 2) the change in the potential energy of a body rising above the 
Earth’s surface is, generally speaking, not described by the expression 
mg (ho — hy). Thus, 


Us,000— Up = — 5.3% 1014 — (— 6.11014) = 0.81014 ergs = 816,000 kg-m, 


while the calculation with the expression mg (hg — hy) yields 1,000,000 kg-m. 
However, when we are concerned with ascensions to a height kh < R (R is the 
radius of the Earth), it is permissible to use the simplified expression 
mg (hg — hy). 


Us,000= = —5.3x10M ergs. 


The formula for the potential energy of the electrical interaction 
of charges is very similar to that for gravitational potential energy. 
Let us consider two electrical charges q; and g, having the same 
sign and separated by a distance r. According to Coulomb's law, 
the particles will repel each other. Therefore, in reducing their 
separation to the small distance dr, we perform work equal to 


—dA = — UP ar (the minus sign is used in the left-hand member 
because the work is performed against the forces of the system; 
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the right-hand member also has a minus sign because the distance 
is decreasing and dr is negative). The calculation, which in no 
way differs from that for gravitational force, yields for the energy 
of electrical interaction of charges (for brevity called Coulomb 


energy) the expression U = a, i.e., here too dA = — dU. 


The interaction energy of charges having opposite signs is nega- 
tive and behaves like gravitational energy. The interaction energy 
of charges having the same sign is equal to zero when the charges 
are separated by an infinite distance; it increases as the charges 
are brought together. 

We shall restrict ourselves to these examples of potential energy, 
although in various cases other functions of state of a body may 
be introduced. 

Potential energy always exists when forces act between bodies 
or particles of the system under consideration that depend on the 
distance between the bodies. Potential energy is the interaction 
energy of the bodies. If a system consists of a number of bodies 
or particles, we can then speak of its total potential energy, i.e., 
the interaction energy between all the particles (each with all the 
rest). Thus, in the case of four particles, the potential energy is 
composed of six terms, for we must consider the interaction be- 
tween the first body and the second, third and fourth; the second 
body and the third and fourth; and finally the third body and 
the fourth. 

In mechanics, only the potential energy of forces acting between 
different bodies is considered. If a body is complex and consists 
of many particles, the interaction potential energy of these parti- 
cles is considered to remain unchanged during the mechanical proc- 
esses. The interaction potential energy of the particles comprising 
the body is a component part of the body’s internal energy (Chap- 
ter IX). If changes in a body’s internal energy take place, the 
Phenomena must be considered in the light of the laws of thermo- 
dynamics (Chapter IX). 


12. Law of Conservation of Mechanical Energy 


Irrespective of the type of forces involved in the motion, the 
work of the resultant force is always equal to the increment of 
the body’s kinetic energy, i.e., 


FAs=A ( me ye 


The forces acting on the body could be elastic forces, gravitational, 
electrical and frictional forces, etc. 
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It is always possible to separate from the effective forces those 
whose work serves to change the potential energy. For brevily, 
such forces are sometimes called potential forces or forces possessing 
potential. The work equation may be written in the form 

FootAs-+fAs=A ( a } ; 


2 


Here, f represents the nonpotential forces. The work of these forces 
is equal to the change in the internal energy of a body or the medium 
in which the body moves. 

Substituting in place of the work of the potential forces the 
increment of potential energy with reversed sign, we can write 
the equation in the form 


fds=A(4>+ u). 


The sum of a body’s potential and kinetic energy is called total 
mechanical energy. Designating this quantity by 6, we obtain: 
fAs = A@, i.e., the change in a body’s total energy is equal to 
the work of the nonpotential forces, e.g., the frictional forces. 

If the work performed in changing the body’s internal energy 

is small with respect to 6, the equation simply reduces to the 
following: Aé = 0 and 6 = const. This is the law of conservation 
of mechanical energy, which states that the total mechanical energy 
of a body is conserved. 
_ This law may be easily generalised for a system consisting of 
many bodies or particles. For each body we may write a work 
equation and then combine these equations into one. The total 
energy will then be equal to the sum of the kinetic energies of the 
bodies and the potential energy of interaction: 


v v3 
= A AU. 


If all the interacting bodies are taken into account (such a system 
of bodies is called a closed system), the form of the law remains 
the same as for a single body. The change in mechanical energy 
is equal to the work of the nonpotential forces and, if this work 
is negligible, the total mechanical energy of a closed system of 
bodies remains unchanged, i.e., is conserved. 

The law of conservation of mechanical energy is, on the one 
hand, a consequence of the equations of mechanics (Newton’s law); 
on the other hand, it may be considered as a special case of a more 
general law of nature—the law of conservation of energy (Chap- 
ter IX). 

Even in mechanics alone, many forms of interconvertible energy 
are met. In considering the motion of a body under the action 
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of elastic forces or gravitational force, it is easily seen that an 
increase in the energy of one of the mechanical forms is accom- 
panied by a decrease in the energy of the other form. 

Thus, the gravitational force acting on a falling body decreases 
the potential energy of the body and increases its kinetic energy. 
The reverse is true when a body is lifted to a certain height. The 
elastic force making a ball thrown against a wall rebound decreases 
the potential energy of the compressed ball and transforms it into 
kinetic energy. The reverse takes place when the wall stops the 
thrown ball (the interval from no deformation to maximum com- 
pression). ; . 

A stretched spring can raise a load to a certain height. On the 
other hand, a falling load can stretch a spring. Thus, elastic energy 
can be transformed into gravitational energy and vice versa. 

The above examples apply to the transformation of one form of 
energy into another in one and the same body as well as to the 
transfer of energy from one body to another. — 

It is possible, of course, to transfer energy 1n the same form from 
one body to another: one load pulls another by means of a pulley, 
a sphere colliding with another transfers part of its kinetic ener- 
gy to it, etc. 


13. Potential Curves. Equilibrium 


The potential energy of interaction of bodies or particles depends 
on their relative distribution, i.e., it is always a function of the 
coordinates or other parameters describing the location of these 
bodies in space. In the simplest cases, the potential energy. may 
depend on a single coordinate. 

Let us consider the interaction of two particles whose potential 
energy of interaction is described by the function U (x), where z is 
the distance between the particles. For the sake of definiteness, 
let us assume the particles repel each other with a force F. Under 
the action of the interaction forces, the distance between them 
increases by dz, i.e., an amount of work equal to Fdz is performed. 
This is possible at the expense of the potential energy of interaction 
U, which changes by —dU (decrease of energy). 

Thus, —dU=Fdz 
or 
dU 
“dz. ’ 


ee 


i.e., in the case of potential forces the force is equal to minus the 
derivative of the potential energy with respect to z. The nature 
of the mechanical problem is then very simple and is clearly described 


F=— 
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by so-called potential curves, i.e., graphs on which the values 
of the potential energy are plotted as a funclion of the parame- 
ter (Fig. 17). 

In explaining the essence of this graphical method, the analogy 
is usually drawn with the motion of a body on a hill. The meaning 
of the potential curve then becomes particularly clear, for the 
profile of a hill and a potential energy distribution curve that is 
proportional to the height h of the hill coincide if drawn to prop- 
er scale. 

Potential curves consist of crests and troughs, steep and gradual 
rising slopes as well as steep and gradual declining slopes. The 
form of the curve permits us to immediately indicate on which 
portians of the path a large amount of work is performed, on which 
a small amount, and whether the work is positive or negative in 
each case. The steeper the potential curve, the larger the force 
acting on the body. In accordance with the familiar geometric sense 
of the derivative, force is described by the tangent of the angle of 
inclination of the tangential line to thé potential curve. 

The validity of the formula relating potential energy and force 
is completely evident for those particular cases of potential energy 
that we have considered. For the potential energy of a body on the 
farth’s surface: : 
dU 


U=mgh and F-= eae 


= —mg. 


For a body in a gravitational field, in the general case: 


a, MyMg _ _ dU __, my 
UO = —y—— and F= a Ni ee 
For a body subjected to elastic action: 
x2 
(a and F = eee —kex. 
2 dx 
For electrical interaction: 
Eee LR ye ae ees 
r dr r 


Returning to the potential curve plotted in the diagram and 
keeping the above explanation in mind, we can immediately indi- 
cate where the force is greatest and the points where the force acting 
on the body is equal to zero. The latter points, i.e., the positions 
of equilibrium, are at the bottom of the potential well and at the 
potential peak. The positions where the potential energy is a maxi- 
mum correspond to unstable equilibrium, while the bottom of 
the potential well is a position of stable equilibrium. 


13. Potential Curves. Equilibrium 59 


We stated above that the form of the potential curve permits 
us to describe the possible motion of the body. This is not com- 
pletely accurate. In addition to the potential curve, we must also 
know the value of the total mechanical energy of the body. If this 
value is known, we can then indeed deduce from the form of the 
potential curve the possible motion of the body or particle. 

Horizontal lines are drawn in Fig. 17 at the ordinates corre- 
sponding to @; and @s. If @ is the total energy of the particle, 
then we can determine the 
kinetic energy as well as the U(z) 
potential energy from the 
curve. The former is the 
difference between € and U. 

The moving particle cannot 
occupy positions in which the 
polential energy is greater 
than the total energy. Thus, 
the horizontal line @ restricts 
the possible motion of the 
body to certain portions of 
the curve. In the case when 
the energy is represented by 
the lower line @,, the moving 
point has two possible inter- 
vals in which it may be locat- Fig. 17 
ed. It may be either in the 
potential well (and have an oscillatory motion there) or on the 
slope to the right of point A, where it will move downwards or 
upwards depending on whether it acquires or loses kinetic energy. 

The above analysis is completely valid for any kind of potential 
curve. Fig. 18 shows several types of such curves. Thus, in Fig. 18a, 
we see the potential curve for a body oscillating on a spring. The 


Forbidden region for z 
the total energy &2 


Ur) 


V(x) Uz) 


@) b) ¢) 
Fig. 18 
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oscillating body is in a potential well having symmetrical edges. 
In Fig. 18b, a potential curve is shown that is typical for many 
interacting particles, e.g., atoms and molecules. The curve consti- 
tutes a potential well, one of whose edges has a very steep slope 
-while the other has a gradual one. Plotted along the abscissa is 
the distance between the particles. As can be seen from the curve, 
the potential energy is very large at small distances, falls with 
increasing distance, reaches a minimum, then gradually rises tend- 
ing toward some finite limit. The nature of the motion and the 
bond between two interacting particles are completely described by 
this curve. Two cases should be distinguished. The first is when the 
total mechanical energy of this pair of particles is represented by the 
lower horizontal line €,. The second is when the total energy is 
equal to @,. In the first case, the system cannot get out of the 
potential well. This means that the distance between the particles 
lies between the limits indicated in the figure. The mutual motion of 
the particles can only be of an oscillatory character. Such is the 
situation in a stable diatomic molecule. The second case is the reverse 
of the first. The total energy of interaction of the particles is too 
large for them to be constantly linked. The system may get out of 
the potential well, i.e., the bond between the particles cannot exist 
and the particles may fly apart to any distance whatsoever. 

The third potential curve in the figure is a so-called square well. 
Recalling that force is described by the tangent of the angle of 
inclination of the tangential line to the potential curve, we see 
that the potential energy may be represented in the form of a square 
well if the body or particle moves freely without the action of 
forces, yet cannot leave the bounds of the given portion of the 
curve as long as the total energy is less than the height of the sides 
of the well. 


CHAPTER III 
MOMENTUM 


14. Conservation of Momentum 


The product of the mass of a body, or particle, and its velocity 
is known as the body’s momentum (quantity of motion): p= mv. 
The momentum p is thus a vector quantity. In a system of bodies 
or particles, the momentum is equal to the vector sum of the par- 
ticles constituting the system: 


P=Pt+Pet... 

What makes this vector quantity of particular interest to the 
physicist is the fact that in a closed system the vector P does not 
change, irrespective of the motion within the system itself. This 
proposition is known as the law of conservation of momentum. 

The. law of conservation of momentum follows directly from 
Newton’s laws. For each of the bodies in the closed system, the 
following equation is valid: 


: a (mv) =f, 


1. ¢., r 
Pp Al 
au. I . 
Let us consider what happens when we write such an equation 
for each of the bodies and then add the equations. The right-hand 
member of each equation represents the forces exerted on the given 
body by all the other bodies. Thus, the force exerted on the first 
body is equal to the sum of the forces exerted on it by the-second, 
third, etc. Using double indexes, we may write: Py.+ Ii3+Fi,+... 
Similarly, for the forces exerted on the second body, we may write: 
Foj+Foo+Fo3+..-; for the third: F'g:+ M32.+F33+...; etc. 
It is not difficult to see that when the right-hand members of the 
equations are added the result is zero. For each term in the first line, 
there is always a term in another line that is equal and opposite 
to it (in accordance with the law of action and reaction). Thus, 
when #;, and F; are added the result is zero; also Fy3 and I3;; 
etc. Therefore, in a closed system, the following equation holds; 


dp, , dp, , dp Ns d _ 
grt get Get... =0i. 7 (Pit Pot Dst ---) 0 


or 
Dit prt pst... =const. 
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This is the law of conservation of momentum. The magnitude and 
direction of individual moments may change, but their vector sum 
in a closed system does not. 


Magnitudes of some momenta: The momentum of an electron with an 
on FM cm ; F 2 
creas? | that of a rifle bullet is ~ 8:4 105 


kg x metres 
sec : 


energy of 5 ev is ~ 12x10- 


gmxcm _ » kgxmetres 


sec sec ; and-that of a freight train is ~ 107 


15. Centre of Mass 


The methods of finding the centre of gravity of a body are well 
known. If a body is fixed at its centre of gravity, it is in a stale of 
neutral equilibrium. For a sys- 
tem of particles, ora solid bcdy 
considered lo be broken up into 
elementary elements having 
the size of particles, we can 
write an analytical expression 
for the position of the centre 
of gravity. 

Using the rule for adding 
parallel forces (Fig. 19), we 
obtain the foffewing expres- 
sion for the position of the 

Fig. 19 centre of gravity when the 
particles are considered to be 
distributed along a straight line, say the z-axis: 


X= my2y+ More+ m3gx3-+... 


my+mg-+m3-+... 


Here, x4, X2, r3 ... are the coordinates of the particles, and my, 72, 
m3 ... are the masses. Masses are used instead of weights since the 
acceleration of the gravitational force cancels out. 

It is shown in theoretical mechanics that for any distribution of 
particles the expression for the position of the centre of gravity 
has the form: 


| 
X ————>| (+12 *3) J 


Ra + morg+ mgr3-+ ... 
my-+-mg+m3+... ? 
where B is the radius vector of the centre and 74, 7°, 7°3 are the radius 
vectors of the particles. 
Since the acceleration of the gravitational force cancelled out in 
these formulas, we can conclude that the point found has an objective 
significance that does not depend on the gravitational conditions. 
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It is valid, in fact, even if the? body is located in interplanetary 
space under conditions of weightlessness. It makes sense, therefore. 
to replace the prevalent designation “centre of gravity” by a desig- 
nation that more accurately expresses the essence of the matter. 
Thus, we speak of the centre of inertia orcentre of mass of a body 
instead of its centre of gravity. 

We shall directly see the full significance of this designation. Let 
us consider the velocily of motion of the centre of mass: 


dR 


7 ae, 
b dt 


Using the formula for the determination of the centre of mass, 
we obtain: 


__ My U4 HP Mgly + MZUZ-+ . 
my my+m3g+... 


In the numerator we have the total momentum, which is conserved 
in a closed system. Thus, the right-hand member of the equation 
is equal to a constant quantily. We can conclude, therefore, that 
the velocity of the centre of mass does not change in magnitude 
or direction. Or, in other words, the centre of mass of a closed system 
of particles executes inertial motion. 

As we already know, all inertial systems of coordinates have equal 
validity. Hence, we can always go over to a coordinate system bound 
to the centre of mass of the system under investigation and consider 
this interesting point as fixed. In atomic physics, we often consider 
collisions between particles. To study this phenomenon, two systems 
of coordinates are used—the laboratory system (the natural coordi- 
nate system of an observer) and the system bound to the centre 
of mass of the colliding particles. The advantage of the latter frame 
of reference is evident: the total momentum of the particles is 
equal to zero. 


16. Collisions 


The word “collision” should be understood in a somewhat broader 
sense than that used in everyday practice. For the mechanical 
problems that now concern us, any encounter between two or more 
bodies in which the interaction is of short duration will be consid- 
ered to be a collision. Thus, in addition to the phenomena that 
can be classified as collisions in the usual sense of the word—e.g., 
impact of billiard balls and collisions between atoms and atomic 
nuclei—we have such events as a man jumping on or off a street 
car and a bullet hitting a wall. The forces arising as the result of 
such short interactions are so great that the role of all constant 
forces being exerted is negligible. As a result we are justified in 
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considering the colliding bodies as a closed system and we can 
apply the law of conservation of momentum to them. 

In many collisions, the duration of the interaction is measured 
in thousandths of a second. During this 
interval of time, the force rises to its 
maximum value and then drops to zero. 
A typical curve for the force during such 
an impact is shown in Fig. 20. For each 
instant of time during the impact, the 
relationship between the force exerted on 
either of the bodies and the momentum 
of this body is given by Newton’s sec- 
ond law: 


F 


d 
t = (mv) = F. 
Fie. 20 Rewriting this equation in the form 
g- FAt = A(mv), we can say that the prod- 
uct of the average value of the force 
and the duration of its action is equal to the change in momen- 
tum. A more accurate description of the phenomenon is obtained 
if we integrate the above equation from the initial instant 
of impact to the termination of the interaction. It is evident 
that - 


Tt 
Sy F dt =(mv),—(mv),. 
0 


The integral on the left is sometimes called the impulse of the 
force. In the diagram, this quantity is represented geometrically 
by the area under the impact curve (see Fig. 20). 

There is considerable variation in the nature of collisions, depend- 
ing on the elastic properties of the bodies. It is customary to consid- 
er two extreme cases—ideally elastic and absolutely nonelastic 
impacts. 

First, let us consider the latter type. A nonelastic impact means 
an encounter between two bodies whereby. these two bodies become 
joined. Examples of nonelastic impacts are collisions between clay 
spheres, a man jumping onto a moving trolley, the collision between 
oppositely charged ions resulting in the formation of a molecule, 
and the capture of an electron by a positive ion. 

Assume that the bodies moved with velocities v; and v2 before 
the encounter. Thus, the total momentum was m,v,-+ move. After 
the encounter the bodies have a common mass equal to m,+ mz 
and move with some velocity V. The momentum of the system 
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after the encounter is (m,—m.)F. Since the law of conservation 
of momentum requires that 
(m+ me) V = myo, + mgVo, 


the velocity of the bodies after the nonelastic impact is given by 
the formula: 
i a = my Uv; mete 
my Mg 
The momentum after the encounter should equal the sum of the 
momenta before impact. 

If the motion of the bodies colliding head-on is along a straight 
line, then after impact the bodies will follow the direction of the 
body having the originally larger momentum. If the momenta of 
the bodies are equal in magnitude, MV, = —Mgv2; V is thus equal 
to zero, i.e., the colliding bodies come to a standstill. 

A nonelastic impact is accompanied by a transformation of 
energy. From the example just given, it is seen that the kinetic 
energy may even become zero. It is not difficult to calculate the 
increase in the internal energy of colliding bodies in one or another 
case. All that we need do is perform the following subtraction: 

my + me yee mv? vs 
2 (74 2 )- 

Let us now consider ideally elastic collisions, i.e., collisions in 
which the form of the bodies is completely restored. This means 
that no changes occur in the state of these bodies, their potential 
and internal energy before and after impact remain unchanged and, 
consequently, the kinetic energy is conserved. For two bodies col- 
liding in this manner, two equations can be written that are based 
on the law of conservation of momentum and the law of conservation 
of kinetic energy. Let us designate the masses of the bodies by m 
and M. We can always make the origin of the coordinate system 
coincide with the position of one of the bodies. This simplifies the 
problem without in any way making it less general. Let us assume, 
therefore, that the body having mass M is at rest before impact. The 
above laws of conservation then yield the following two equations: 


mu=mv+MV and + neu? 5 mv? + 5 MvV?. 


Here, w and v are the velocities of sphere m before and after impact, 
and V is the velocity of sphere MM after impact. 

Let us consider several examples using these equations. First, 
we shall examine the case of noncentral* collision of two spheres 


* The impact is classified as central if the motion of the spheres before 
ampaet occurred along a straight line passing through the centres of the 
spheres. 
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having equal mass (Fig. 21). The masses cancel out in both equations 
and we obtain 


w=v+V and ur—v?-+V?. 


From the vector equation, it is clear that the vector 2 closes the 
triangle formed by vectors v and V. The equation on the right shows 
that the triangle, for which w is the 
hypotenuse, must be a right triangle. 
Hence, it follows that the velocities, 
after the collision of two particles hav- 
ing equal mass, must be directed at 
right angles to each other. This inter- 
esting conclusion is easily verified in 
billiards, where the directions of motion 
of the object ball and the cue ball form 
an angle of 90°. In other respects, the 
nature of the velocity change is not de- 
termined by our equations, for they do 
not take into consideration the deviation 
of the line of impact from the line passing through the centres 
of the spheres. 

A complete description of the motion of the spheres after impact 
is obtained if we restrict ourselves to the case of central impact. 
The motion of the colliding spheres will then be along the same 
straight line after impact as before impact. We can dispense, there- 
fore, with the vector notation, keeping in mind, however, that 
a change in the velocity’s sign means that the direction of motion 
has changed. In this case, there is no need for making the simpli- 
fying assumption of equal masses. The equations for central colli- 
sion have the form. 


mu=mv-+ MV and mu? = mv? + MV?, 


Rearranging terms, these equations can be written in the form 
m(u—v)=MV and m (u®—v?) = MV?. 


Dividing the latter by the former, we obtain: vu -+ v = V oru = 
=—(v—V). Note that the relative velocity of motion of sphere 
m with respect to sphere M before impact (designated by wu) is 
equal in magnitude to the same relative velocity after impact. 

An interesting formula is obtained when we substitute V = u + v 
in the formula for the law of conservation of momentum. We obtain 
an expression for the velocity of sphere m after impact in terms 
of the velocity of this sphere before impact: 

m—M 


oe Ne aa 
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If the masses of the spheres are equal, the velocity v reduces to 
zero. This phenomenon can be demonstrated very effectively with 
steel or ivory spheres. For such an impact, the spheres, so to speak, 
exchange velocities (Fig. 22). In other cases, sphere m is retarded. 
The closer the values of the masses of the colliding spheres, the 
more effective the retardation. 
It is not difficult to calculate Z 
that when a neutron (mass 1) 
rebounds from a carbon atom 
(mass 12) it loses ?/;3 of its ve- 
locity and when it rebounds from 
a uranium atom (mass 235) it 
loses only 2/935 of its velocity. 
For macroscopic bodies, the 
laws of elastic impact are quite Fig. 22 
valid for such materials as ivory, 
steel and rubber. These materials, after having been deformed, are 
able to reassume, to a high degree, their original form. This is 
illustrated by the interesting photograph shown in Fig. 23, where 
by means of slow-motion photography 
the moment of impact of a hockey 
ball on an obstacle is filmed. In 1/5 o99 
of a second, the ball is compressed 
almost one centimetre, and it takes 
the same amount of time for the re- 
storing phase of the impact. In the 
first phase, the kinetic energy of the 
impact is transformed into potential 
energy of elastic compression. In the 
second phase, the potential energy is 
transformed back into kinetic energy. 
For an ideal impact, this reverse proc- 
ess should completely restore the 
value of kinetic energy expended dur- 
ing the first phase of the impact. 
Our formulas aré not applicable for 
the important case of elastic impact 
of a sphere on a wall (Fig. 24). Since 
the kinetic energy must be conserved, 
the velocity of the sphere cannot change 
Fig. 23 in magnitude. As regards the direc- 
tion of the sphere’s motion afterimpact, 
it should form the same angle with the normal (90°—a) as before 
impact. Thus, in the case of impact on a smooth wall, the tangential 
component of the velocity remains unchanged, since no tangential 
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adhesion forces are exerted by the wall. As can be scen from the 
figure, the increment of the momentum is numerically equal to 
2mv sin a and is directed along the normal to the wall. According to 
the. fundamental law of mechanics, at the 
instant of impact, the force exerted on the 
sphere by the wall has the same direction as 
that of the vector of momentum change. 
The angle of incidence of the sphere is, there- 
fore, equal to its angle of reflection. 


Let usconsider an inelastic impact, using as our 
example a ballistic pendulum (a device for measuring 
‘the velocity of a bullet). A bob containing sand, mass 
M, hangs from a line. The bullet is fired into the bob 
and becomes imbedded in the sand. The momentum 
of the bullet before impact is mu, and the momen- 
tum of the system after impact is (Af -+- m) v. Hence, 


an m 
mm 

Fig. 24 Mov2 

Acquiring the kinetic energy ae 


it in rising to the height h, determined by the following condition: 


, the bob expends 


Mgh= 


Mv? _. u2 m \2 
Zi i.e. coe (+7) (nm < M). 
If M = 10 kg, m= 10 gm and u = 900 metres/sec, then kh = 4 em. . 

If we had not used the law of conservation of momentum in determining h, 
but had assumed instead that the total kinetic energy of the bullet had been 
transformed into potential energy of the pendulum, we would have obtained 
the value k = 40 metres (!). This means that, in our example, 399.6 kg-m of 
mechanical energy, or 99.9% of the total supply, has ‘‘disappeared’’, i.e., gone 
to heat the system. Since absolutely clastic bodies do not exist, mechanical 
energy is not conserved for ‘‘elastic’’ impacts as well, for part of it is transformed 
into energy of thermal molecular motion and is dissipated. We shall return 
to_this example in Chapter XI (p. 181). 


Using an example involving collision we shall now illustrate 
the merit of a coordinate system bound to the centre of mass. 

Assume that a sphere of mass m, at rest in a laboratory coordinate 
system, is hit by a similar sphere with velocity v. If the impact is 
inelastic, some portion of ae the kinetic energy of the system, is 
transformed into heat. In other coordinate systems, the kinetic 
energy of this pair of spheres is expressed by other quantities. As 
regards the heat released, it will be the same for the given pair of 
spheres and is simply determined by the velocity of their relative 
motion. Therefore, instead of resorting to the law of conservation 
of momentum to try to determine the portion of the kinetic energy 
that is transformed into heat, calculated for the laboratory coordinate 
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system, it is sufficient to calculate the kinetic energy for a coordi- 
nate system bound to the centre of mass. Since in such a coordinate 
system the total momentum of bodies is equal to zero, after an 
inelastic collision the spheres come to a standstill: all the kinetic 
energy is transformed into heat. The kinetic energy will have 
a minimal value for a system bound to the centre of mass. 

In a coordinate system bound to the centre of mass, the spheres 


P os { - : 
move toward each other with velocities =v. The kinetic energy of 
; . 1 ° 

each sphere is thus equal to Zin and the total energy of the system 
is a mv?, This is the amount of heat released during an inelastic 
collision. Irrespective of the type of impact, the heat (or other form 
of energy) released due to the kinetic energy of the bodies cannot 
exceed the amount of kinetic energy calculated for a system bound 
to the centre of mass. And conversely, in order to release a given 
amount of heat, it is necessary to have the equivalent amount of 
kinetic energy calculated for a centre of mass system. 

Example. A nuclear reaction in which a@-particles bombard nitrogen: N!4 
takes place in accordance with the following equation: 

NUL Het > OH, 

The energy absorbed in this process amounts to 1.13 Mev. How much kinetic 
energy in a laboratory system must an a-particle possess in order for the reac- 
tion to proceed? At first glance, it seems that 1.13 Mev is sufficient for this pur- 
pose. But we already know that this is not the case. In a centre of mass coor- 
dinate system, 1.13 Mev is required, but in a laboratory coordinate system, 
more energy is needed. 
mV; + mMmgvo 


my-+ me 
is the momentum of the first particle and mgve is the momentum of the 
second. The velocity of the first particle in a centre of mass coordinate 


Thus, the velocity of the centre of mass is vp= , Where m,2%, 


system is vj = v,— vp = ——2— (vy— v2). For the second particle, we may 
my+me 
: ; m . : 
write: Oia te, ve Hence, the kinetic energy of the sys- 


tem (a, N14) in a centre of mass coordinate system is Kem= 5 (v4 — v9), 


myms 

my-+imy ; 
consider the nuclei of N14 fixed (v2'=0). This assumption is justified, since 
we can always neglect the slow thermal motion of the target nuclei as 
compared with the large velocity of the bombarding particles. The kinetic 


where = is the so-called reduced mass of both particles. We shall 


energy in the laboratory coordinate system is then Kiav=5 m,v? and there- 


fore 
my mo 


Kiap = K 
lab cm Me 
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The reaction proceeds if Ken =1.13 Mev. Since my=4 and mg =14, we 


obtain 
Kigy=1. 13x18 = 1.45 Mev. 
14 


17. Recoil 


The law of conservation of momentum helps one to easily under- 
stand the fundamentals of recoil in gunfire, reaction propulsion, 
and other similar phenomena. 

We shall consider, in the first place, recoil taking place in a frame 
of reference where the bodies are at rest at the initial moment. 
In the case of gunfire, this assumption is in complete accord with 
the prevailing conditions. If at the initial moment a system con- 
sisting of two or more bodies is at rest, the tolal momentum of 
the system is equal to zero. Irrespective of the future course of 
events, the total momentum continues to be equal to zero. Thus, 
if at some instant an explosion takes place, causing the system to 
be divided into parts having masses m,, mg, m3, ..., which fly 
asunder with velocities v4, v2, v3, ..., the total momentum mv; +4- 
+Mgv2+m3v3-+ ... of the scattered bodies must be, as before, 
equal to zero. ; 

In the case of gunfire (where the system divides into two parts), 
the condition that the momentum of this system of two bodies be 
equal to zero has the form mv-+ MV = 0. Here, the lower-case let- 
ters refer to one body, say the missile, -and the capital letters to 
the other—the gun. The division of the system into two parts can 
only take place along a straight line. We can, therefore, dispense 
with the vector notation and write the condition in the form mv = 
= —MV. The velocities of the gun and the missile are inversely 
proportional to their masses. Thus, the greater the mass of the 
missile with respect to the mass of the gun, the greater the observed 
recoil. 

The phenomenon of “continuous recoil’’, occurring in reaction pro- 
pulsion, is of exceptional interest. It is the subject of a distinctive 
branch of mechanics that may be called the mechanics of variable 
mass. This phenomenon does not only occur in jet planes. Indeed, 
we can point to a number of commonplace occurrences involving 
such motion. As examples, it is sufficient to mention the case of 
an uncoiling roll of paper or the fall of droplets continuously con- 
densing in the atmosphere (see the example at the end of this sec- 
tion). The fundamentals of the mechanics of variable mass were 
developed at the end of the nineteenth century by Prof. I. V. Me- 
shchersky. Since we cannot describe his work here, we shall restrict 
ourselves to the consideration of a single problem in this field—a 
problem related to the possible velocity of motion of a rocket. 
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A rocket moves with a velocity v and at some instant ejects 
a certain amount of combustible gas having mass dA7. The mass of 
the rocket, naturally, decreases by this amount. If the velocity of 
the ejected gas is designated by u (this velocity is not given with 
respect to the rocket, but with respect to the inertial coordinate 
system in which the velocity of the rocket motion is described), 
the momentum of the matter escaping from the rocket will be equal 
to udi7. The rocket decreases its mass and increases its velocity by 
the amount dv. The momentum of the rocket after ejecting the fuel 
is equal to (AJ —dM) (v+ dv). In accordance with the law of con- 
servation of momentum, we can equate the momentum JJv of the 
rocket before discarding a portion of the fuel and the momentum 
of the system after that quantity of gas has been ejected. The latter 
is equal to the difference between the momentum of the rocket 
and the momentum of the portion of fuel. Thus, 


Mv = (M— dM) (v+dv)—udM. 


Whence, excluding second-order infinitesimals, 
dM 
dv= (u + v) TT: 

But u-+v is the relative velocity of the outflowing combustible 
gas (with respect to the rocket). Designating this velocity by c, we 
arrive at the following equation for the increment in-the rocket’s 
dM 
M 
velocity increases when the mass decreases. It can be seen that the 
increase in velocity is equal to the fraction of the lost mass mul- 
tiplied by the relative velocity of the ejected fuel. 

Taking the velocity of the outflowing gas with respect to the 
rocket to be a constant value, the above equation can be easily 
integrated. If the mass of the rocket was M ) when the velocity of 
the rocket was vo, and became equal to AZ when the velocity of 
the rocket changed to v, integration yields 

ro M 


velocily: dv= —c . The minus sign is used to show that the 


aM 
\ d= —c \ ir? 
vO Mo 
1. @., 
= Mo 
v—Uyp=cln i: 


The latter formula was initially obtained by K. E. Tsiolkovsky, 
the first to design a rocket and do research in the theory of inter- 
planetary travel. 

Going over to common logarithms and introducing the designa- 
tion m= M,)—M for the difference in the mass of the rocket, i.e., 
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for the mass of the ejected fuel, we obtain Tsiolkovsky’s formula 
in the form 


v=cx2.3x log (1 cr) 


(the initial velocity v9 is assumed to be equal to zero). 

In modern rockets, the velocity of gas outflow is probably not 
less than 2,000 m/sec. Using this velocity in the formula, the 
following table of values is obtained: 

om 0.2 1.0 4.0 10.0 32.3 54 999 
v (m/sec) 446 1,386 3,218 4,817 7,013 8,000 13,815 

As can be seen from this table, the rocket velocity increases 
much slower with respect to the amount of ejected fuel than one 
would like. To give the rocket a large velocity, a tremendous amount 
of fuel, relative to the initial mass of the rocket, must be ejected 


Thus, if a velocity of 7 km/sec is imparted, less than a of the ini- 


tial rocket mass will remain. 

A velocity of about 11 km/sec must be imparted to a rocket if 
it is to escape from the Earth’s gravitational pull. This figure is 
obtained in the following simple manner. To escape from the Earth, 
a rocket must possess sufficient kinetic energy to perform the work 
of moving a body from the Earth’s surface to infinity. But this 
work against the force of gravity is equal to the difference between 
the rocket’s potential energy at the Earth’s surface and at infinity. 
Since at infinity the potential energy is equal to zero, the condition 
for escape from the Earth has the following simple form: 


mv? mM 
=, “7? 


2 R 


where M and AR are the Earth’s mass and radius, respectively. 
Multiplying the numerator and the denominator of the right-hand 
: P 1 

member of the equation by R, then substituting vor by g, the 
acceleration of the gravitational force at the Earth’s surface, and 
cancelling the rocket’s mass, we obtain the condition for escape 
from the Earth: v = VY 2gR, which yields a figuré of about 11 km/sec. 

If we assume that the velocity of the gas outflow is 2,000 m/sec, 
the ratio can be obtained from Tsiolkovsky’s formula. It is 
equal to 244. For the rocket to escape from the Earth, its design 


must be such that only ats of the rocket’s mass before take-off 


will remain in its interplanetary flight. The problem is an excep- 
tionally difficult one. Basic advances in the field of astronautics 
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will be achieved, therefore, when methods of producing rocket 
engines with considerably larger velocities of fuel ejection are 
found. If we were to succeed in increasing this velocity by a factor 


. . . : . a m 
of three, i.e., increasing this velocity to 6 km/sec, the ratio—— would 


M 
fall to 5.3. 

It is easier to put an Earth satellite into orbit because a smaller 
initial velocity is required. If we assume that the acceleration of 
the gravitational force at the heights at which we desire the satel- 
lite to orbit is appoximately the same as at the Larth’s surface, 
then the law of mechanics for artificial planets will have the form 
mg = ma, and since the satellite moves in a circle, the centripetal 
force is a = = . Thus, the velocity of the rotating satellite is v= 
=VgR, i.e., 8 km/sec. When such a velocity is imparted to 
a rocket, it is transformed into an Earth satellite. From the above 


table, we see that the value of aT required for imparting a velocity 
of 8 km/sec to a rocket is 54. 


Example of motion of a body with variable mass. Consider a water droplet 
falling in an atmosphere saturated with water vapour. At the instant of time 2z, 
the droplet has a mass m and a radius r. During the time dé, the volume of the 
droplet, and hence the mass (for a density equal to 1), increases by 4zr? dr. 


‘ : . o a : gies 
Thus, the rate of increase in mass is Se = far® =: At the same time, it is 
, d : 
clear from physical considerations that a , the rate of condensation of the 


water vapour, must be proportional to 4ar?, the condensation surface. Hence, 
a = const and r= kt, where k is some constant of proportionality. 

Let us derive the equation of motion of this droplet in the Earth’s gravi- 
tational field. We are interested in the change-of momentum d (mv), which 
according to the fundamental law of mechanics is equal to Fd, where F = mg. 
Thus: 


d : dv dm 
F= (mv), i. e., mg =m —— -F or ra 
Substituting the expressions for m and r, we obtain 
Leer ea 
ae e 
By integrating this equation, we arrive at the following result: v = & t, i.e., 


the droplet falls with the constant acceleration F ga 245 em/sec?. The resist- 
ance of the air was not taken into consideration. 


CHAPTER IV 


ROTATION OF A RIGID BODY 


18. Kinetic Energy of Rotation 


In this chapter, we shall be concerned with “perfectly rigid” 
bodies. This means that we may neglect any deformation occur- 
ring during the motion of such a body, and assume that the dis- 
tances between the particles of the body remain unchanged. 

Let us consider a rigid body rotating about a fixed axis passing 
through it (Fig. 25). We can con- 
ceive of the hody as consisting of 
small volumes of masses Am, Amo, ... 
at distances r;, Ts,... from the axis of 
rotation. Corresponding to the various 
values of the distances are the various 
velocities of motion 1,, Us, ... We are 
interested in the kinetic energy of rotation 
of the entire rigid body, which is com- 
posed of the kinetic energies of the indi- 
vidual particles Am,, Ams, ..., i.e., 


= am ere 4 ants eae 

The velocity of angular motion of any 
point of the body can be easily expressed 
in terms of w, the angular velocity of the 
Fig. 25 rotating body. If the body turns through 
an angle dp in the time dt, the deriva- 


. dg . ; 
tive a is called the angular velocity: 
_ dp 
— “dt . 


For the case of uniform motion, the above formula is transformed 
into a relation already known to the reader, namely, ont, The 


quantity is usually measured in radians per second. If the body 
performs 41 revolution per second, its angular velocity is equal to 
2n rads/sec. 

Different points of a rotating rigid body have different veloci- 
ties v (called linear velocities), but the angular velocity w is the 
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same for each point. In turning through an angle dq, a point de- 
scribes an arc ds =rdg. Dividing both members of this equation 
by the time of motion dt, we obtain a relationship between linear 
and angular velocity: 


v=0or. 


Thus, the formula already known for uniform motion is valid in 
the general case. 

Using this relation, the expression for K,.; may be written in 
the following form: 


Krot == (r7Am,-b r3Amyg+...). 


The quantity in brackets does not depend on the velocity of motion, 
but is a measure of the inertial properties of a body executing 
rotational motion. The greater the value of the expression in brack- 
ets, the greater the energy that must be expended to achieve 
a given velocity. Therefore, the quantity 

T= rjAm,+ryAm-+... 


is called the moment of inertia of the body, and the term r?Am is 
the moment of inertia of a particle. The quantity J may be ex- 
pressed more briefly as follows: 


l= \ redm, 


where the integration (summation) encompasses all the particles 
of the body. ; 
The formula for a body’s kinetic energy of rotation acquires the form 


2 
Kyrot = ca 

This formula is valid for a body rotating about a fixed axis. For 
a rolling body (a ball, wheel, etc.), the energy of motion will consist 
of the energy of rotational and translational motion. Thus, if 
a rolling body has a mass M, moment of inertia J, translational 
velocity v and rotational velocity w, the kinetic energy is 


Mv2 , To2- 
Kron = 7° 


Moreover, it turns out that this formula is valid for any arbitrary 
motion of a rigid body. In theoretical mechanics, it is shown that 
any arbitrary motion can always be resolved into translational and 
rotational motion. The rotation, in this case, must be considered 
with respect to an-axis passing through the centre of mass. 
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19. Moment of Inertia 


If we carefully examine the formula for moment of inertia, we 
see that the value of J depends on the nature of the distribution 
of the mass with respect to the axis of ro- 
tation. The particles that are far from the 
axis of rotation contribute considerably 
more to the total value than those that are 
close to it. 
Let us calculate the moment of inertia of 
a flat disk, of radius r, relative to the axis 
y perpendicular to the plane of the disk and 
passing through its centre (Fig. 26). The mass 
of an annular element of radius z is dm =: 
= 9 X 2nxdzx, where p is the density of 
Fig. 26 the disk’s material. This ring has a mo- 
ment of inertia dJ, = dmz?, and the mo- 
ment of inertia of the entire disk is: 
r 
rj mr® 


‘ 
1,=) dt,=(p x 2x x ade = 2p = 
0 


. 2 


It is evident that with respect to the same axis the moment of 
inertia of a ring is J, = mr®, i.e., J, = 22,, when the entire mass 
is concentrated at the outer circumference. 

The moment of inertia of a body will vary in accordance with 
the location of the axis of rotation. If a thin rod rotates about its 
long axis, the moment of inertia will be very small, for all the parti- 
cles lie very close to the axis of rotation and,. therefore, all the 
quantities ri, r3, ... entering in the formula for J will have very 
small values. The moment of inertia will be much larger if the rod 
is rotated about a line perpendicular to its axis. 

The moment of inertia depends on the ori- 
entation of the axis and the location of the 
point through which it passes. If no specific 
stipulation to the contrary is made, it is assumed 
that the axis of rotation passes through 
the body’s centre of mass. 

If the axis of rotation is displaced relative to 
the centre of mass by the amount a (Fig. 27), 
I, the new moment of inertia, will differ from 
Io, the moment of inertia with respect to the 
parallel axis passing through the centre of mass. 

In view of what was stated at the end of 
the previous article, we can express the kinetic 
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energy of a body rotating about the displaced axis as the sum 


> Mv? @2 
alae Maa BER 


where v is the velocity of motion of the centre of mass and is equal 
to aw. Thus, 


_ Ma®o? , Iw? 


Ka 45 => (lo+ Ma’). 


Hence, the moment of inertia J with respect to a parallel axis, 
displaced by a distance a from the centre of mass, can be expressed 
as follows: 


I=I)+ Ma’. 


It follows that the moment of inertia with respect to an axis pass- 
ing through the centre of mass is always the smallest possible for 
a given orientation. Depending on its symmetry, a body will have 
one, two or three moments of inertia with respect to the main axes 
passing through the centre of mass. 

Thus, a disk is characterised by two axes passing through its 
centre—one lying in the’plane of the disk and the other perpen- 
mr2 «mre 

eae B 


dicular to the disk. The moments of inertia are then ae 
respectively (it is assumed, naturally, that the distribution of mass 
throughout the disk is uniform). For a ring, the moment of inertia 


Peg -. mre 3 
about similar axes is — and mr, respectively. 


For all solids of revolution, it is sufficient to know the moments 
of inertia with respect to two axes. In the case of a body of arbitrary 
form, to completely describe the inertial properties of the body 
during rotation, it suffices to know three moments of inertia with 
respect to axes passing through the centre of mass, namely, Jmax— 
the largest moment of inertia, J,,;,—the smallest, and Imean—the 
moment of inertia with respect to an axis perpendicular to the 
first two. 

The only body for which the moment of inertia about all the 


, « . 2 
axes is the same is a sphere. For a sphere, [J = = mr, 


The above formulas for moment of inertia are calculated from 
the relation: 


I= \ r2dm. 


To use this formula, it is generally necessary to be able to operate 
with multiple integrals. Examples of such calculations are given 
in courses on theoretical mechanics. 
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As we shall see below, physicists are sometimes interested in 
the values of moments of inertia for molecules. Since the mass of 
atoms is concentrated in nuclei whose dimensions are very small, 
the calculation of the moments of inertia can be accomplished 
without difficulty, for the atoms may be considered as point masses. 

For a diatomic molecule, the moment of inertia with respect to 
the axis passing through the atoms is equal to zero. For the axis 
perpendicular to the line joining the atoms we obtain 


. 


T= myra+ msrp, 


where r4 and rg, are the distances of atoms A and B of a diatomic 
molecule to the centre of mass. If l is the distance between atoms, 


r m 
rat Tg =l and ra ee Therefore, 


MaMgB P 
mya+ma 


The moments of inertia of more complex molecules may also be 
calculated as the sum of the moments of inertia of the atoms con- 
sidered as point masses. 


Examples. 1. The flywheel of a ship’s engine has a mass of about 1 ton, 
a diameter of 2 metres and, therefore, a moment of inertia J ~ 1,000 kg m?. 
Making 300 rpm, the flywheel possesses a kinetic energy of rotation 


Iw? 


Kaa 


= 500,000 joules = 50,000 kg-m. 


2. The moment of inertia of the Earth is about 1045 gm cm? = 1038 kg m?. 
The kinetic energy of rotation of the Earth about its axis is 2.5 < 1029 joules. 

3. In a molecule of hydrogen Hy, the distance 1 = 0.753 X 107-8 cm, the 
mass of the hydrogen atom my = 1.6998 X 10-%4 gm and, therefore, the moment 
of inertia of the molecule with respect to the axis perpendicular to 1 is 


— myy,12 
i 2 


=0.46x 10-40. gm cm? 


\ 


20. Rotational Work and the Fundamental Equation 
of Rotation 


If a body fixed on a shaft is made to rotate by a force F or, on 
the other hand, if a rotating body is braked by the force F, the 
kinetic energy of rotation increases or decreases by the magnitude 
of the expended work. Just as in the case of translational motion, 
this work depends on the effective forces and the displacement 
produced thereby. However, the displacement is now angular, 
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and the expression that we know for the displacement of a particle 
by a certain distance is not applicable here. - 

To find the formula that we are interested in, let us refer to 
Fig. 28. The force F is applied at a point located at a distance r 
from the axis of rotation. The angle between the direction of the 
force and the radius vector is designated by 0. Since the body is 
perfectly rigid, the work of this force (even though applied at one 
point) is equal to the work expended in rotating the entire body. 
In rotating the body through 
an angle dg, the point of ap- 
plication traverses the path 
rd and the work dA, equal to 
the product of the projected 
force along the direction of 
displacement and the magni- 
tude of the displacement, is 
then 


dA= Fr sin ®@ dg. 


Fr sin 90 is known as the 
moment of force or torque: M = Fr sin ®. From the diagram, it is 
seen thatrsin 0 =d, where dis the shortest distance between the 
line of action of the force and the axis of rotation. Hence, 


M = Fd, 


j.e., the torque is equal to the product of the force and the lever arm. 
The formula for work that we have sought is 


dA=Mdpg. 


The work of rotating a body is equal to the product of the effective 
torque and the angle of rotation. 

Strictly speaking, the formula is only valid for an infinitely small 
angle dp. However, we may use it in any case if we understand 
MM to mean the average value of the torque for the time of rota- 
tion. Then, ; 


AA= M,y,Ag. 


The work of rotation goes to increase the kinetic energy of rota- 
tion. Hence, the following equation must hold: 


Mdg=a (=> Js 


jf the moment of inertia is constant for the time of motion, then 
Mdo=Iodo — 
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A d 
or, since o=—-, 


dw 
M => I an . 


This is the fundamental equation of motion for a rotating body. 
The torque acting on a body is equal to the product of the moment 


5 é - d 
of inertia and the angular acceleration a é 


Examples. 1. The torque on a wheel of a locomotive developing a traction 
of about 10,000 kg is about 300 kg-m ~ 3,000 newton X metres. 

A man riding a bicycle produces a torque of about 100 newton X metres 
on the pedals. 

2. By means of an example, we shall show the connection between the 
expression for the kinetic energy of a moving rigid body (see p. 75) and the 
fundamental law of mechanics. 

Let us consider a spool of mass m and radius 
r, possessing a moment of inertia / with respect 
to its axis and wound with weightless thread 
(Fig. 29). The free end of the thread is fastened 
at a certain height above the Earth’s surface. 
The spool is allowed to fall under the action of 
its own weight mg. Hence, the equations of mo- 
tion for the spool are: 

du 


mg—T =m —- 


and 


where 7 is the tension of the thread and wo is the 
angular velocity of rotation of the spool. Elimi- 
nating 7, we obtain for the acccleration: 
BAO cas oe 
ae ie 
mg 1+ 


mr2 
Fig. 29 


If time is counted from the moment the spool be- 

gins to fall, then in ¢ seconds the spool will fall a 
2 . 

distance h =< . It is evident that the total kinetic energy of the spool at 

that instant is equal to the change in the potential energy of the spool: 


v2 
K=mgh= mg a 


Substituting the expression for a, we obtain 


21. Angular Momentum to 


21. Angular Momentum 


The similarity between the formulas of motion of a particle and 
the derived laws for the rotation of a rigid body is immediately 
evident. Thus, compare the following formulas: 


Particle Rotating body 
: d@ 
F= ar M=I—=- 
ie mov? k _ Jo? 


Clearly, the physical concepts are also analogous. While in the 
mechanics of particles the acceleration is determined by the force, 
in rotational motion the angular acceleration is determined by the 
moment of force, i.e., the torque. The role of mass is played by 
the moment of inertia, which in rotation is the measure of a body’s 
inertia (the mass alone is insufficient here for this purpose). This 
similarity encourages us to go a step further and assume that anal- 
ogous physical quantities are related by analogous relations. 

In the previous chapter, it was established that the momentum 
p=mv is a physical quantity satisfying the law of conservation in 
a closed system. The quantity analogous to p is the moment of 
momentum (angular momentum): 

N=Io. ; 
It can be rigorously proved that angular momentum satisfies the 
law of conservation, i.e., in a closed system, the total angular 
momentum of the bodies belonging to this system does not change. 
An increase in the angular momentum of one of the bodies is com- 
pensated for by an equivalent decrease in the others. 

The relation. 

Tyo, + Le@_ + LW3-+ ... = const 
has many interesting applications that are in many ways analogous 
to the problems studied in the previous chapter. 

The law of conservation of momentum when applied to a single 
body has the form mv = const and is, therefore, identical with the 
law of inertia. Even in this simple case, the law of conservation of 
angular momentum leads to an interesting result. A single body, in 
the absence of interaction with its medium, must satisfy the condition 

Jw = const. 
However, the moment of inertia of a body may change during 
motion. It is, therefore, evident that an increase in J must be accom- 
panied by a decrease in w, and vice versa. 

One can cite numerous examples, and this phenomenon can be 
strikingly demonstrated by means of a swivel stool. Holding a pair 


6—1409 
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of dumb-bells in your hands, be seated on such a stool (Fig. 30). 
Place your arms outstretched in a horizontal position and have 
someone give you a small rotatory push. Motion takes place for the 


, Fig. 30 


particular moment of inertia J at an angular velocity w. Now,, fold 
your arms on your chest. As a result the moment of inertia drops 


Fig. 34 


sharply to I’. Since the product Jw remains unchanged, Jo = J’o’. 
Thus, changing the position of one’s arms leads to a considerable 
increase in the velocity of rotation. The process may be repeated— 
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stretching one’s arms out leads to retarded motion and folding 
them produces accelerated motion. 

Decreasing the moment of inertia as a method of increasing the 
velocity of rotation is quite familiar to gymnasts and dancers. It 
is used in all kinds of jumps, tumbles and spins. Thus, a ballet 
dancer in a position of large moment of inertia will impart velocity 
by changing her posture to a position of small 
moment of inertia (Fig. 34). 

Rotational recoil is usually demonstrated by 
means of the aforementioned swivel stool and 
a wheel fixed on a long axle (Fig. 32). While 
standing on the stool and holding the wheel 
above one’s head, the wheel is twirled by 
means of a sudden movement. As a result, 
the stool rotates in the direction opposite to 
that of the wheel. This is precisely what is 
meant by recoil. /,;, the angular momentum 
of the wheel, is balanced by JW, the angular 
momentum of the stool with the person stand- 
ing on it (the angular momenta have oppo- 
site signs). This is due to the fact that in the 
initial state both the stool and the wheel did 
not rotate, and the total angular momentum 
was equal to zero. 

An inelastic impact was defined above as 
an encounter between two bodies as a result 
of which the bodies move together. Some- 
thing analogous may be demonstrated in the 
case of rotation using the equipment just de- 
scribed. The wheel is made to rotate and is 
then transferred to a person standing on the stool. Thus, the initial 
state is the folowing: the stool and the person standing on it are 
at rest, while the bicycle wheel is rotating with a momentum J,a,. 
Now, the person on the stool takes hold of the wheel. The angular 
momentum J,@; cannot disappear, but it now belongs to the entire 
system. Naturally, the person on the stool and the wheel rotate 
together in the same direction as the wheel was rotating. Clearly, 
Tyo, = (4+J2)@. If before “unification’? the person rotated with 
a velocity w,., the angular momentum to be conserved is J40,+ 
+I.@,. Therefore, . 


Wyte 


This is very similar in form and in content to the expression for 
inelastic impact. 


L401 + Ip@_ = (4 + 2) o or O= 


6* 
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Examples. 1. The flywheel of a ship’s engine has a moment of inertia of 
yme 
sec 
2. A billiard ball whose radius is 2.6 cm has a moment of inertia J = 
= 250 gm cm? and moves with a velocity of 5 m/sec without skimming the 
table. Its angular momentum is then ~50,000 gm cm?/sec = 5 X 1073 kg m*/sec. 
3. The angular momentum of the Earth in rotating about its axis is 
~1034 kg m?/sec. 


1,000 kg m? and at 300 rpm its angular momentum is ~ 30,000 


22. Free Axes of Rotation 


Let us assume that a body has received angular momentum about 
some axis to which the body is fastened. Further, let us assume 
that the fastening is then removed. While the angular momentum 
must be conserved (naturally, neglecting friction), the orientation 
of the body in space may 
change. If this occurs, 
and as a result there is a 
change in the moment of 
inertia, it will be com- 
pensated for by a corre- 
sponding change in the 
angular velocity. 

However, in a num- 
ber of cases the nature 
of the rotation does not 
change. Stable rotation 
a takes place about the 
original axis, just as if the axis of rotation were fixed as before. 
Theory and experiments show that there are two axes passing through 
the centre of mass that may be permanent, free axes of rotation, 
namely, the axis of maximum moment of inertia and the axis of 
minimum moment of inertia. 

If the fixed axis of rotation passes through the centre of mass 
(Fig. 33), but is inclined to the axes of symmetry and, therefore, 
to the afore-mentioned orientations, then after the fastening is 
removed, the body begins to change its orientation with respect to 
the axis of rotation. It can be seen from the figure that the reason 
for the change of orientation is the fact that the centrifugal forces 
form a couple of forces. The body will continue changing its orien- 
tation until the axis of rotation becomes a free axis. 

It can be shown in a number of ways that a freely rotating body 
will keep changing its axis of rotation until the rotation occurs 
about a free axis. Tying bodies of various shapes to one end of 
a string, and attaching the other end to the shaft of a rapidly rotat- 
ing motor, we can transmit rotary motion to a body without having 
a fixed axis of rotation. In Fig. 34, the successive orientations of 


Fig. 33 
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a rotating hoop, chain and match box are shown. The match box 
will begin rotating about its shortest or about its longest edge. 
Theory shows that rotation about the axis having a mean moment 
of inertia will not be stable even if this axis is an axis of symmetry. 


‘ 


¢) 


Pig. 34 


In constructing one of the first turbines, allempts to fix the position of the 
shaft with sufficient accuracy to eliminate the couple of centrifugal forces 
acting on the bearings at a velocity of 30,000 rev/min were unsuccessful. At 
such high velocities, these forces are intolerably great. The problem was solved 
by using a flexible shaft for the turbine wheel. The rotation took place 
about the free axis and the flexible shaft adapted itself to this axis. 

Let us consider this phenomenon in somewhat more detail. We shall desig- 
nate the shift in the centre of gravity of the turbine wheel due to the wheel’s 
asymmetry by a and the amount by which the shaft sags under the action of 
the centrifugal force by A. The shaft sags in the direction of the asymmetry. 
Hence, the expression for the centrifugal force may be written in the form 
4n?n?M (a+ A). This force is balanced by the elastic force kA, where k is 
the stiffness of the shaft. Thus, 


4 


oy 
4n2n2M 


The formula shows that when the number of revolutions per minute 7 is large 
the shaft’s sag A does not increase, but tends to become equal to minus a, the 
measure of the wheel’s asymmetry. This means that when the angular veloc- 
ity of the turbine increases the total displacement of the wheel with the shaft 
from the axis of rotation tends to become equal to zero. Herein lies the adapta- 
bility of the flexible shaft: It can bend, without breaking, by the amount re- 
quired to eliminate the centrifugal force. 

From the above formula, it follows that the condition k/4n2n?M = 1 is 
critical, for the relation shows that the shaft’s sag becomes infinitely large. 
This is the instant of resonance which must be rapidly passed in running the 

pease: 


A=a 


turbine (the external frequency n= a i.e., nm coincides with the 


natural frequency of a turbine wheel of mass M placed on a shaft of stiffness hk; 
see Chapter V). 
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23. The Gyroscope 


The term gyroscope usually denotes a device that can rotate about 
any orientation of its axis. If a gyroscope is rotated and then not 
interfered with, its axis of rotation will remain unchanged as long 
as no forces act on it (Iw, in this case, should not change). 

The action of a force on a gyroscope’s axis of rotation manifests 
itself in a somewhat surprising manner. This may be demonstrated 
using a gyroscope that is balanced by a load in such a manner that 
the axis of the device is horizontal (Fig. 35). The gyroscope is 


Fig. 35 Fig. 36 


rotated in the vertical plane and a load G is placed on the 
horizontal bar. It would seem that the entire right-hand portion, 
i.e., the gyroscope, should move upwards. Indeed, this would be 
the case if the gyroscope were not rotating. Actually, the rotating 
gyroscope begins to move with constant velocity about the vertical 
axis as shown by the dotted line and the arrow. This motion is at 
a right angle to the direction of the applied force. 

This phenomenon, revolving of an axis of rotation about the 
direction of an applied force, is called precession. Everyone is famil- 
iar with the precessional motion of a top. As soon as the axis of 
the top begins to deviate in the least from the vertical, a gravitation- 
al torque begins acting on the top, tending to topple it. A stationary 
top would fall, but a rotating top begins to precess about the verti- 
cal. The axis of the top will then describe a cone whose vertex is at 
the point of support of the top. 

In general, the rotation of a top is even more complex, for nuta- 
tions are superimposed on the precessional motion. These nutations 
are due to small jolts (which are always present) that make the 
top shake (Fig. 36). As a result of the nutation effect, the axis 
describes a cycloidal curve, as shown in the figure, instead of a cir- 
cle. It should be noted, however, that nutational effects are usually 
very weak. 


CHAPTER V 


VIBRATIONS 


24, Small Deviations from Equilibrium 


The motion of a body or particle about an equilibrium position 
is often encountered in nature. Thus, a small load on a string oscil- 
lates back and forth, a spring quivers, and an atom in a crystal 
lattice vibrates. 

If the body or particle on which forces are acting is in a position 
of equilibrium, its potential energy is a minimum and the system is 
in a potential well (Fig. 37). When the deviation ror the equilibri- 
um position is not large, we are 
concerned with only a small portion 
of the potential well. A potential 
curve near the equilibrium position 
can always be approximated by a 
parabola, i.e., it can be written in 
the form U=+ ka*. Here, + kis 
the constant of proportionality. The 


factor + has been introduced for 


convenience and] its purpose will tr oananooea Koy ooneanane VY 


presently become clear. 

The reasoning used in arriving at : 
the above relation is the follow- Fig. 37 
ing: Potential energy is a function 
of the displacement from the equilibrium position. As is well known, 
making the proper assumptions, any function may be expanded in 
a Taylor series for small values of z. The exponent of x increases 
consecutively from term to term: 


IN 


U =ax+ ka? + ba? 4 cat 4 Se 


However, for small z, the terms of higher power may be neglected 
and, if the potential well is symmetrical, the first term vanishes, 
for the potential energies at equal distances to the left and right 
of equilibrium are equal. 

The force acting at a point deviating from the equilibrium posi- 
tion is equal to minus the derivative of the potential energy. Thus, 
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if the energy is expressed by the formula U =+ kx*, then F =—kx. 


The meaning of the negative sign is clear, namely, the force in 
question always restores the body to the equilibrium position and 
is always directed oppositely to the displacement. Consequently, 
the force F = —kz is called the restoring force and the coefficient / 
is sometimes called the restoring force constant. 

What is the nature of the motion under the action of the restoring 
force? Newton’s law, which is written in the form ma = —kz for 
motion near equilibrium, should give us the answer to this question. 

This equation is satisfied if the point undergoes harmonic vibra- 
tion about the equilibrium position, i.e., vibration in accordance 
with the relation 


2m 
z= Acos pos 


where 7 is the period of vibration. 

Let us verify this statement. The velocity of motion of the point 
for the indicated dependence between displacement and time is 
es dz 2nA ei 2m t 
gp Ts ype 
It should be noted that the maximum value for the velocity of 


the vibra onal motion, i.e., the amplitude of the velocity, is 
0A 


Umax = pF - Let us now determine the acceleration by taking the 
derivative of the velocity. We obtain 
4n2 Px 
a= — a Acos-t. 
Substituting the expressions for acceleration and displacement in 
Newton’s law, ma = —kz, we obtain 
4 2m 27 
—m 5 Acos |-t= —kAcos art. 


We see that the factors depending on time cancel out. Hence, the 
equation for harmonic vibrations satisfies Newton’s law for small 
deviations from equilibrium. 

It is noteworthy that Newton’s law places a constraint on the period 
of the vibrations. As can be seen from the last formula, the period of 


the free vibrations about the equilibrium position is T=2n ~ ‘ 
This period is determined by the vibrating system—the restoring 
force constant & and the mass of the particle. It is, therefore, under- 
standable that this period is called the natural or characteristic 
period of the vibrating system. | 
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No restrictions are placed on the amplitude A of the vibrations, 
with the exception, of course, that the deviations from the equi- 
librium position must be small. 


25. Particular Cases of Vibrations 


In view of the fact that we deal with two types of potential ener- 
gy in mechanics, namely, elastic and gravitational, it also becomes 
possible to divide mechanical vibrations into these two cases. 

Bodies vibrating under the action of an elastic force usually 
perform linear vibrations of compression and extension. However, 
torsional vibrations are also encountered. 

If a body suspended from an elastic band, spring or wire is dis- 
placed from the equilibrium position along the band, spring or 
wire axis, linear vibrations arise under the action of the elastic 
restoring force. The coefficient * is, in this case, the stiffness of 
the vibrating body. 


To what extent this coefficient determines the resulting period and fre- 
quency of vibration is seen from the following example. Identical loads, whose 
masses are equal to 1 kg, are suspended from three springs having different 
stiffnesses. Under the action of these loads, the springs are elongated by 1 mm, 
1 cm and 1 metre, respectively. The coefficients of stiffness will then have the 
following values: 


981 x 108 
0.1 


dynes 


> dynes, 6 ; 
CYROS | kg =0.981x 108 SES ; 


key = =0.981 >< 10 


ie o-gntciot 288. 
cm 


The periods and frequencies of the vibrations are: 


am 408 geeks 
T,=2n Ep = an 0.981107 = 8-34 10 sec, vy = 15.8 cps; 


T,=0.2 sec, Ve=5 cps; 
T3=2 sec, v3=0.5 cps. 


For torsional vibrations, the restoration to equilibrium takes 
place under the action of a torsional moment that is directly pro- 
portional to the angular displacement for small deviations from 
equilibrium. If, for example, a massive disk having a moment of 
inertia J is suspended from a wire, and the wire is twisted by some 
angle or other, the equation for the torsional vibrations of the 


disk will be J a= —Dg. The torque D, relative to unit angular 


displacement, corresponds to the restoring force constant, and the 
moment of inertia corresponds to the mass. Thus, the period of free 


90 Vibrations 
torsional vibrations is represented by the formula 


Lan ae 


The greater the moment of inertia, the lower the frequency of the 
vibrations. 


Ezapmie. Assume that a disk faving a mass of 100 gm and a radius of 
5 cm is suspended from a steel wire and that the period of the torsional 


2 

vibrations is 1 second. The moment of inertia of the disk is I,= ie = 
: 4n2 

= 1,250 gm-cm?. Thus, the restoring force constant p= “Tt = 49,400 

Sune xem | If a disk of the same mass but of 1 cm radius is suspended 


from the same wire, the period of the torsional vibrations will no longer 


be 1 sec, for T2=2n V x02 sec. 

A body oscillating under the action of gravitational force con- 
stitutes a pendulum. If the pendulum may be approximately repre- 
sented as a point mass suspended from a weight- 
less wire, we call it a mathematical pendu- 
lum (Fig. 38). 

From the figure, it is easily seen that the 
expression for the restoring force is mg sin a, 
i.e., the component of the weight along the. 
tangent to the path. If the deviation from 
equilibrium is small, the sine of the angle may 
be replaced by the value of the angle a or by 
the quotient obtained when the displacement 
x is divided by the wire length J. In this ap- 
proximation, displacement along the chord is 
assumed to coincide with displacement along 
the arc. Thus, the restoring force is equal to 


mg > and the restoring force constant is equal 


me 
L — 


, 
mass of the bob cancels out and 7 = 2n V Ze : 


The fact that the period of a pendulum does 

Fig. 38 not depend on the mass is an example of a com- 

mon feature of particle motion in a gravitation- 

‘al field. Since according to the law of gravitation the force 
acting on such a particle is proportional to the mass, the mass can- 
cels out in the equation of motion. Thus, we have arrived at the 


to . In the expression for the period, the 
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well-known result that, for a given location in a gravitational field, 
the period of a mathematical pendulum depends only on its length. 

The measurement of the period of a pendulum may be used to 
determine g. The value of this measurement may be determined 
extremely accurately so that very minute variations in the value 
of g may be ascertained. Various methods of determining the Earth's 
shape and various gravimetric investigations are based on this 
measurement. (Small changes in the value of g, which, however, 
greatly exceed the limits of experimental error, may occur due to 
scams of various density below the Earth's 
surface.) 

When the small oscillations of a physical body 
cannot be approximated by a point mass, the pen- 
dulum is called a physical pendulum. Fig. 39 shows 
arigid body whose axis of rotation (oscillation) 
passes through it. The period of the physical 
pendulum is calculated by the same formula as 
for torsional vibrations: 


T= 2n ee 


since the equation 


dw, ‘ ; 
fag = Oe Fig. 39; 


is valid for the motion of any body rotating about an axis. However, 
in the case of the gravitational field, we can easily express the torque 
relative to unit angular displacement by a more direct pendulum 
characteristic. From the same figure, it can be seen that the torque 
is equal to mgr sin a, i.e., the product of the weight of the body, the 
distance r from the centre of gravity to the point of suspension, and 
the sine of the angle of deviation from the equilibrium position. 
Since the deviation from the equilibrium position is assumed to be 
small—as always in this section—we obtain the expression mgr a 


mgr & 
a 


for the torque; whence, D = = mgr. Thus, the period of 


a physical pendulum is given by 
T=2n ot OR V ae 
mgr g 


The quantity l’ = a is called the equivalent length of the phys- 


ical pendulum. This is the length that a mathematical pendulum 
would have for such a period. 
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26. Transformation of Energy. Damped Vibrations 


If there is no friction, the total energy ¢« of a body naturally 
remains unchanged for vibrations about its equilibrium position. 
Since potential energy is usually expressed relative to an arbitrary 
level, we shall assume that the potential energy in the equilibrium 
position (displacement z = 0) is equal to 
zero. At any instant of motion, 
mv2 kx? 

2 ss ls 
In the equilibrium position, the kinetic 
energy is a maximum. In the end po- 
sitions, the body comes to a standstill 
(v= 0 and xz=A) and the potential 
energy is a maximum. It is evident from 

this, incidentally, that 

kA2 
=a 
Fig. 40 i.e., the vibrational energy is propor- 
tional to the square of the amplitude. 


E= 


For the three springs considered in the example on p. 89 assuming the 
amplitudes of the oscillations are the same, i.e., A = 0.1 cm, the total vibra- 
tional energy will have, respectively, the following values: 


€;=0.49x 105 ergs; e¢2=0.49104 ergs; e3= 49 ergs. 


This discussion has not taken into account the frictional force, 
which, as a rule, is experienced by all vibrating bodies. Such ideal 
vibrations will continue for ever without change in amplitude. 
Friction, however, produces damped vibrations. Formally, in this 
case too, it is possible to write the displacement equation in the form 


xz=Acoswt, 


but A is understood to decrease with time (Fig. 40). To determine 
how A depends on time, the frictional force must be known, i.e., 
fy must be known for every instant of time during which vibrations 
occur. A simplifying assumption, more or less satisfied in practice, 
is that the frictional force is proportional to the velocity of motion: 


ftp =v, 
where the coefficient a is known as the resistance constant. 
For a ball having a radius of 0.53 mm, the resistance constant a at about 


15°C is 13.93 gm/sec in glycerine, 0.35 gm/sec in sulphuric acid and 0.01 gm/sec 
in water. 
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The energy equation can now be written in the form 
de= —avdz 


and the vibrating particle continuously loses an amount of energy 
equal to the work of the resisting force. Hence, the equation of 
motion is written as follows: 


3 


ma= —kx—av. 


By substitution, it is not difficult to show that this equation is 
satisfied by the equation z = A cos wt when the amplitude A de- 


creases exponentially with time: 
a 


A=Ae 2”, 


Here, A, is the amplitude at the instant of time ¢ = 0. 

It should be noted that the ratio of two successive amplitudes 
is a constant. Thus, the expressions for the amplitude after n—1 
and n periods, respectively, are 


(n nT 


——*_(n-1)T ee 
An-1 => Ape-» ,2m and An => Aye 2m 
Let us divide the former relation by the latter. The ratio 
a 
ce eee 
An 
does not, in fact, depend on n. The rate of damping is sometimes 
expressed by the logarithmic decrement 8: 


= An-; _ a 
6=In An 2m r. 


Thus, the damping is greater, the greater the resistance constant, 
the smaller the mass, and the greater the period. 

It should be noted that the period of damped vibrations differs 
from the period of free vibrations. The same calculation that leads 
to the formula for the time dependence of the amplitude also yields 
the following relation for the period: 


Tats 


This means that, for small resistance, 7 differs little from Ty) = 


m ois be . ° 
= 20 V r: When the resistance increases the period increases and, 
finally, for 
a2 


= 


4mk 
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vibrations cease. We say, in this case, that the body displaced from 
the equilibrium position returns aperiodically to this position. 

Here are some approximate values for the logarithmic damping 
decrement of certain vibrating systems: 


Acoustical vibrating systems ..... 0.: 
Electrical oscillation circuits . . . . 0.02-0.05 
Tuning fork. .°. 2. 2. ee ee ee 10-3 
Quartz crystal ............ {0-4-10-5 


Let us consider several examples of damped vibrations. 
a) Vibrating tuning fork. Logarithmic decrement b= T = 107%. 
Assume that the period of the vibrating tuning fork is T = 0.01 sec. Then, 


5 =04 sec-1. This means that during the time 2m 40 sec the amplitude 
of the vibrations decreases by the factor e: i 


a 
-~—! 
Ap=Age 7" ; At—19 = Age7!. 
The quantity sali = Tt is called the time constant of the given vibrating system. 


b) In acoustical vibrating systems, as can be seen from the above table, 
the logarithmic damping decrement is large. This means that the vibrations 


are rapidly damped. If 6 = = T =0.1, the amplitude of the tenth vibration, 


Ajo, Will already be less than the initial amplitude Ao by the factor e. 
Thus, 


c) The change in the period of damped vibrations may be conveniently illus- 
trated by means of a spring. Let a load having a mass m= 50 gm be suspend- 
ed from a steel spring, which is thereby elongated by 2 cm. Thus, the stiffness 
of the spring is k = 24,500 dynes/cm. If there were no damping, 


To= 20 / 20.28 sec. 


am 
Assume that the damping is such that the time constant t, = —— = 9 sec, 
i.e., the resistance constant a = 20 gm/sec. The period of the vibrations then 
becomes 


T= —20_ 74 (1 4.4.08 x 10-5). 


a2 
Vi- 4mk 


Let us now immerse this pendulum in liquid. Assuming the time constant in 
this case to be to = 41sec, the amplitude of the fourth vibration will already 
be 1/e of the initial amplitude, i.e., there is considerable damping: 


T, & Ty (14102 x 10-5) & 1.0017. 
Thus, even in this case the period increases by only 0.1 per cent. 
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27. Forced Vibrations 


If a body is displaced from its equilibrium position and then 
not interfered with, the vibrations occur at the natural frequency 
of the body, independent of the nature of the excitation, i.e., the 
vibrations are determined only by the properties of the system. 
The frequency of the vibrations of a string remain the same regard- 
less whether the sound was made by the string being plucked or 
struck. 

At the same time, a number of means exist of “locking” the vibra- 
tions of a body to an external frequency. Such forced vibrations 


| 


| 
] 
| 
] 
| 
| 
| 
: (Fy) 
f Wy 


Fig. 41 Fig. 42 


may take place if two bodies capable of vibrating are coupled. One 
of the bodies will force the other to vibrate. A motor that is improp- 
erly balanced will execute vibrations that are transmitted to the 
foundation, i.e., the foundation will execute forced vibrations. 
We can perform the following experiment: A pocket watch is placed 
in a small box and suspended by three strings. As a result, the box 
passes into a state of forced vibration. In Fig. 44, a device is shown in 
which a rotating eccentric makes a pendulum pass into a state of 
forced vibration. In all these cases, a periodic force varying with 
some frequency acts on a body. Such a force is aptly called an 
external force. 

Forced vibrations do not set in immediately. A certain amount 
of time must elapse before the body coupled to the vibrating system 
begins to vibrate. Eventually, a particular amplitude is reached 
and the frequency of the vibrations will be exactly equal to a. 

The fact that a body has a natural frequency of vibration wo 
nevertheless affects the phenomenon of forced vibrations. To be 
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more exact, as we shall directly see, the natural frequency and 
the external frequency differ significantly. Fig. 42 shows the 


dependence of the amplitude of forced vibrations on the ratio on 


for three systems having different amounts of friction. When the 
external frequency and the natural frequency coincide, the amplitude 
of the forced vibrations is a maximum. This phenomenon is widely 
known as resonance. 

The curves shown in Fig. 42 may be determined theoretically. 
The equation of motion of a body executing forced vibrations, 
under the action of a periodic external force /’y) cos wt, has the form 


ma= —kx—av-+F)cos at. 
By substitution, one can easily show that the displacement of 
a vibrating point will satisfy the equation 


x= Acos(ot +f), 
where the amplitude 
Bas re ot eS 
Vink (0 — 02)? -- ea? 


and the phase shift B satisfies the equation 


—aw 
tan B= m (wXR—w?) ° 
F i dex dz i 
Taking into account that a= TE and v = ae let us substitute these 


values in the equation of motion. After simple conversion and grouping terms 
containing cos wt and sin wt, we obtain 


[(— mw2-L k) A cos B—awA sin B— Fo] cos wt — 
—[(—mw? + k) Asin B-+a04 cos B] sin wf =0. 


Since the obtained equation must be valid for every instant of time, the coeffi- 
cients of cos wt and sin wt must be equal to zero. Thus, we obtain two equa~ 
tions for determining A and B 


[(—mw?2-+ k) cos B—aw sin B] A= Fp 
[(—mw?- k) sin B + aw cos B] A=0. 
Squaring both equations and adding, we obtain 
Fo 


A= Ol ———————————— 
V m2 (08 — 2)2Lakn? | 


Kk 
where @) = y= is the frequency of the natural vibrations. From the second 
equation, we obtain the phase shift f: 
: — ao 


On P= in (op OF 
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From the first formula it follows that the amplitude A depends 
on w as follows: When w<po the amplitude increases as w increases; 
when @ = @) the amplitude reaches a maximum; and when 
© > Wp the amplitude decreases as w increases. This effect (sharp- 
ness of resonance) is more pronounced, the smaller the resistance 
constant a. When there is little friction, the resonance disrupts 
the system, for at «=O the resonance amplitude goes to infinity. 
Engineers must take this into account in their calculations. 
To design a structure so that it is in- 
sensitive to the vibrations of its foun- 
dation, a resonance curve similar to 
the one shown in Fig. 43 must be 
available. The lower curve shows the 
vibrations of the foundation and the 
upper one, of the structure. At reso- 
nance, which occurs when the period 
of the vibrations is 0.32 sec, the amp- 
litudes reach a value of 20-25 microns. 
This, in general, isno small amount. 

The sharpness of resonance is an in- 
dicator of still another important 
phenomenon, namely, the sharper the 
resonance, the slower vibrations of 
constant amplitude set in. 

Another feature of forced vibrations 
is the presence of phase shift. Until 0 
now, we have assumed that the origin 030 a40 
of the coordinate system was so select- Pertod (sec) 
ed that with respect to t=0 the maxi- ; 
mum displacement is in the positive Fig. 43 
direction. Naturally, if we are consid- 
ering only one vibration, there is no need to select any other ori- 
gin. However, if we are comparing two vibrations and pick the ori- 
gin so that x = A when ¢ = 0, then the displacement of the other 
vibration at this particular instant may have an arbitrary value. 
This circumstance may be taken into consideration by introducing 
the phase shift B in the argument of the cosine. Thus, if x = 
= Acos (wt + B), then z=A cos f at the instantof time t=0. The 
phase displacement is uniquely described by means of the phase 
shift B. 

ee now return to resonance phenomena. The quantity B in 
the formula for forced vibration ‘indicates that the phase of the 
forced vibration, generally speaking, is shifted with respect to the 
Phase of the impressed vibration. The magnitude of the phase 


Amplitude 
= 2 8 


z 
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shift depends on = , the ratio of the natural frequency to the exter- 


nal frequency, and also on the damping. Fig. 44 shows that a 90°- 
phase shift occurs at the resonance frequency, independent of the 
damping. The effect of the damping becomes clear when the situa- 


Phase LAG 
NT ES) 


ao at 
Fig. 44 


_tion somewhat removed from the resonance condition is considered. 
For weak damping (small logarithmic decrement 6), at frequencies 


Fig. 45 


somewhat below resonance, the phase shift is almost zero, while 
at frequencies somewhat above resonance, the phase shift is almost 
480°. The same tendency exists for heavy damping, but it is not 
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so pronounced. For a small amount of friction, one can say that 
a 180°-phase shift occurs when the frequency passes through the 
resonance condition. 

A simple experiment (Fig. 45) will demonstrate the essence of 
these interesting relationships. Suspend a weight by a string and 
allow it to swing freely. When the period of the free vibrations of 
this pendulum is manifested, stop the pendulum and by periodic 
motion of the hand bring it into a state of forced vibration. 
At first move the hand rapidly, so that the period of the natural vibra- 
tions is greater than the period of the forced vibrations; then move 
it slowly, so that the period of the natural vibrations is less than 
the period of the forced vibrations. It will be seen that in the first 
case the pendulum and the hand are 180° out of phase, while in 
the second case they are in’ phase. 


Let us again consider thespring on page 94, which has a damping constant 
a= 20 gm/sec, a mass m= 50 gm and a period 7) = 0.28 sec (Wp = 
= 22.4 sec"). When an external sinusoidal force of frequency @ = Wo acts 
on the spring, the amplitude of the forced vibrations is equal to A = 4 cm 
when the amplitude of the impressed force is Fo = 1,790 dynes ~1.8 gm. 
A deviation of the frequency of the impressed force from Wo results in a change 
of the amplitude of the forced vibrations and a change in the phase shift B 
between the vibrations of the spring and the external force. The table shows 
the data obtained for various deviations by means of the formulas derived 
in this section. 


Freq. of eternal force SO tniee a Cony: Phase angle — B (degrees) 
2 3.58 0°05’ 
10 3.95 0°35’ 
15 4.48 0°15’ 
22.4 4.04 90° 
30 2.48 178°50’ 
40 1.31 179°10’ 


It is seen that in the presence of damping themaximum amplitude of the forced 
vibrations is reached when the frequency of the impressed force is some- 
what less than the natural frequency of the vibrations. The weaker the damp- 
ing, the smaller this shift in frequency. 


28. Self-Sustained Vibrations 


Fig. 46a shows a trough of triangular cross-section fixed on 
a shaft about which it can rotate. The trough has some particular 
period of free oscillations, which may be observed by swinging the 
trough away from its equilibrium position. The oscillations will 
continue as long as friction and air resistance do not stop them. 


7* 
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Let us place the trough under a water faucet and allow the stream 
of water to flow evenly on the wall of the trough, at a point some- 
what removed from the centre line. It is not difficult to envisage 
what will ensue. As more and more water pours into the trough, 
the height of the centre of gravity rises until, finally, it exceeds 
the height of the shaft to which the trough is fixed. The pressure 
of the stream of water is now sufficient to upset the trough; where- 
upon, water flows out and the 
trough returns to its original 
position. This cycle keeps re- 
peating as long as the stream 
of water continues to flow. 
Thus, the trough will oscil- 
late. However, the character 
of the oscillations produced in 
this manner is quite different 
from the oscillations consid- 
ered above. 

In the first place, it is im- 
portant to note that the exter- 
nal force is not of an oscillatory 
nature; i.e., it is a constant 
force (the pressure of a stream 
of water). Secondly, such a 
system executes undamped 

time ——> oscillations, although subject 

0) to the action of friction and 

other resistance. And finally, 

Fig. 46 the resulting oscillations are 

not harmonic, i.e., they do not 

have asinusoidal shape. Thus, 

in our example, the similarity to a sinusoid is nil. By conducting 

such an experiment, it can be shown that the dependence of the 

amount of water in the trough on the time may be represented by a 
saw-toothed curve similar to that shown in Fig. 46). 

The oscillations described above may be classified as self-sustained 
oscillations. Such oscillations constitute a distinct phenomenon, 
basically differing from free, undamped oscillations occurring with- 
out the action of a force, as well as from forced oscillations occurring 
under the action of a periodic force. The above example may appear 
to be artificial. However, self-sustained systems have broad 
application and are very often encountered wherever mechanical 
and other oscillations occur. 

A simple pendulum clock (Fig. 47) executes self-sustained oscil- 
lations. As is well known, such a clock is actuated by a falling 
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weight suspended from a chain that passes over a gear wheel. This 
wheel is located on the same axis as a balance wheel, which can 
mesh with a symmetrical anchor escapement. A pendulum is rigid- 
ly fixed to the escapement. At the instants when the balance wheel, 
which is driven by the gear wheel via a gear drive, touches the 
pallets of the escapement with its teeth, the pendulum is given 
an impulse. The rest of the time the pendulum and the escapement 
swing freely, while the balance wheel moves by itself. The escape- 
ment and the balance wheel are so constructed that the pendulum 
obtains two impulses each time the 
balance wheel advances by one 
tooth. One impulse is obtained when 
the pendulum moves from left to 
right, and the other when it moves 
from right to left. 

The self-sustained vibrations of 
a clock are basically similar to 
those of the trough of triangular 
cross-section. The vibrations occur 
under the action of a constant rather 
than a periodic force, are undamped 
in spite of the presence of friction, Fig. 47 
and are not harmonic. 

In the above examples, a common property of self-sustained 
vibrations is manifested, namely, the property known as feedback. 
A pendulum executes undamped oscillations and causes a mechanism 
to give an impulse at appropriate moments. The mechanism pushes 
the pendulum and the pendulum provides feedback to the mechanism. 
If the pendulum stops, the impulses also cease. The oscillations of 
the pendulum are governed by the pendulum itself. 

In exactly the same manner, the swings of the triangular trough 
ave governed by the trough. The stream of water regulates the 
swinging of the trough, while the construction of the trough itself 
regulates the water inflow. : 

A string struck with the fingers and then released is in a state of 
free vibration. The situation is different when a string is drawn 
with a bow. In this case, the string executes self-sustained vibra- 
tions that are saw-toothed in shape. The bow pulls the string along. 
When the displacement reaches a certain limit, the string separates 
away from the bow, returning to its original position. The bow 
again pulls the string along and the process is repeated. In the 
space of the second that the musician draws the bow, the phenom- 
enon is repeated hundreds of times. These are typical self-sustained 
vibrations since they are due to a continuously acting force. The 
string itself controls the vibrations by its elasticity. 
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The squeaking sounds emanating from door hinges in need of 
oiling also belong to this class of vibrations. 

We say that feedback occurs whenever an instrument or machine 
automatically introduces automatic corrections to its action when 
the operating conditions change. The principle of feedback is one 
of the fundamental concepts in automation. 


29. Addition of Parallel Vibrations 


In a number of cases, the problem arises of analysing the motion 
of a body simultaneously executing two vibrational motions. Thus, 
an oscillating pendulum may 
be located on a vibrating plat- 
form, or it may be on a roll- 
ing ship. 

If we are concerned with 
vibrations in a single direc- 
tion, the addition occurs as 
shown in the model in Fig. 48. 
Two pendulums, in this case, 
oscillate in parallel planes. 
A light rod lies freely on the 
pendulums and a _ recording 
pen is attached at its centre. 
As an approximation, we can 
assume that the pen will re- 
main ina plane differing little from the planes of vibration of the pendu- 
lums and that the displacement of the pen at a given instant will 
be equal to the algebraic sum of the pendulum displacements. Another 
arrangement may also be used, e.g., a ball oscillates on a spring 
suspended from a board and the board, in turn, is attached to a post 
by a spring in such a manner that the ball simultaneously executes 
two different vibrations in a single plane. 

If z, is the displacement of the first vibration in the absence 
of the second, and zx, the displacement of the second vibration in 
the absence of the first, then, at each instant, for simultaneously 
occurring vibrational processes, 


Fig. 48 


L=I%y+2o. 


In the most general case, the component vibrations may differ 
in amplitude, frequency and phase. 

Let us first consider the case when the vibrations have equal 
amplitudes and frequencies, but are displaced in phase. Then, 


x,=Acosoat, %_ = Acos (wt + @) 
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2 2 2 


23 . e . . 
where w=. This means that the resultant vibration is also 


T 
harmonic and has the amplitude 
2A cost : 


Hence, it follows that the amplitudes of vibrations add arithmeti- 
cally when the vibrations coincide in phase and subtract when 


AMA Ae | 


y= 180° 


Fig. 49 


they are opposite in. phase (p = 180°). In the intermediate cases, 
the amplitude assumes a value between zero and 2A. In particu- 
lar, when p=120° the amplitude of the resultant vibration is 
equal to A. This is illustrated in Fig. 49. 

Another important case is the addition of vibrations having 
different frequencies. For simplicity, let us assume that mp = 0 and 
the amplitudes are equal. Then, 


£,=Acosa,t, Zg=ACcoS@t, and 
r= 2Acos-AtO2 cos “t= "2 7, 


In the general case, the vibrational motion obtained when such 
vibrations are added does not exhibit a distinct periodicity with 
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respect to the displacement z. However, two particular cases de- 
serve special consideration. 

First, let us consider two vibrations whose frequencies are close 
to each other. Then, w; — wz < w; + @, and the displacement z is 
the product of two cosines, one varying rapidly with time and the 
other very slowly. Hence, 

2A cos “12 
may be considered to be the slowly varying amplitude of vibra- 
tions occurring with an average frequency Wap = LE The fre- 


quency of the slowly varying amplitude is known as the beat 
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Fig. 50 


frequency. Fig. 50 clearly shows the two frequencies—the basic 
frequency of the vibrations and the beat frequency. 

The second important case is the addition of two vibrations 
when one of the frequencies is a multiple of the other. It is quite 


Fig. 54 


evident that the resultant vibration will be periodic. If, for example, 
the period of one vibration is 3 sec and that of the other 7 sec, the 
resultant vibration will be repeated every 21 sec. This is shown 
in Fig. 54. 
30. Vibration Spectrum 
We have already spoken about vibrations that repeat with preci- 
sion every specific interval of time, but are not harmonic. For 


example, we have considered saw-toothed vibrations. If we are 
sufficiently exacting, it turns out that harmonic vibrations, i-e., 
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those represented by sinusoids, are encountered in’ nature and 
engineering much less often than nonharmonic vibrations. 

At the end of the previous article, we noted that the sum of two 
sinusoids is a periodic vibration, even though not a sinusoid, if 
one of the frequencies is a multiple of the other. Naturally, this is 
true for any number of harmonic vibrations and not merely for two. 

The sum of two vibrations having periods 7 and > 
tively, is a vibration having a period 7. Furthermore, this is the 
period of the vibrations obtained by adding three vibrations having 


T, respec- 


periods Ries T and + T, respectively; also, four vibrations—with 
: : A 1 é ‘ 
the additional vibration having the period 7; five vibrations— 


with the additional vibration having the period = T; etc. Converting 


to frequencies, this may be expressed as follows: The sum of any 
number of vibrations whose frequencies are multiples of w, i.e., the 
frequencies w, 2, 3a, ..., is a vibration having the frequency wo. 
Now, the following question naturally arises: By adding an 
arbitrarily large number of vibrations, whose frequencies are mul- 
tiples of and whose amplitudes are selected as required, is it 
not always possible to secure any desired vibration, even saw-toothed? 
Fourier, the French mathematician, proved that this was 
indeed the case. The theorem named after him states that it is always 
possible to select a;, a2, @3, --- and @1, M2, Ps, --- in such a man- 
ner that any periodic vibration having the frequency may be 
represented in the form of a sum of harmonic vibrations: 


X=, COS (Wt + G4) + dy COS (2F + Pz) + 43 COS (Smt + p3)-+ ... 


The frequency is called the fundamental frequency, and the fre- 
quencies 2m, 3a, ... are the overtones or harmonics (e.g., second 
harmonic, third harmonic, etc.). The closer the curve of the vibra- 
tions approaches a sinusoid, the smaller the amplitude of the harmon- 
ics. On the other hand, if the curve of the vibrations hardly resem- 
bles a sinusoid, the amplitudes of some of the harmonics will not 
differ greatly from the amplitude of the fundamental frequency. 
Representing the vibration in the form of a sum of harmonic 
vibrations is called spectrum analysis. The spectrum consists of the 
data on the frequencies and amplitudes of the harmonic vibrations 
comprising the vibration of frequency w. These data may be pre- 
sented in tabular form. If there are many frequencies, however, we 
usually resort to a graphical means of presentation (Fig. 52). 
The concept of spectrum may be extended to include nonperiodic 
processes. We may speak of the spectrum of elastic vibrations 
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produced by a blow on a table, the spectrum of the report of a gun, 
or the spectrum of an outcry. 

To make this clear, let us first consider a process consisting of 
periodic, damped impulses. This is not the case of a single report 
or outcry, but of a series of reports or outcries, i.e., repeated at 
regular intervals of time. Characteristic of this process is the rapid 
damping of such a vibration, whose curve is shown in Fig. 53a. 
The spectrum of this vibration may be established by existing 
means and has the form shown on the right in the figure. As was to 
be expected, we see that the spectrum is composed of frequencies 
that are multiples of the fundamental. It should be noted that the 
spectrum has a maximum, which occurs for the eighth harmonic. 
This is not accidental, for if we examine the vibrations depicted 
on the left in the figure, we see that within each individual impulse 
the damped pulse vibrates at a “frequency” that is 8 times greater 
than the frequency of the fundamental tone (Fig. 53a). 

Similar impulses are illustrated in Fig. 53b, but in this case 
the frequency is one-half of the above. Compare the spectrum of 
this vibration with the previous one. Since the fundamental fre- 
quency isnow one-half of the original, the “frequency” of the damped, 
elementary process (we have assumed that it has remained the same) 
will now be the 16th harmonic of the fundamental tone. The distri- 
bution of the harmonic amplitudes remains as before, but their 
number in the same interval of frequencies is two times greater. 

It is easy to see now that the spectrum of a nonperiodic proc- 
ess—a single impulse—is continuous. Individual frequencies are 
not discernible (Fig. 53c), but the nature of the spectrum is very 
similar to that considered above. 

A mathematical proof of the above conclusions is contained in 
the theory of Fourier integrals. 


31. Addition of Mutually Perpendicular Vibrations 


To analyse a complex vibration consisting of the sum of two 
mutually perpendicular vibrations, it is best to use an electronic 
oscilloscope. We shall discuss this apparatus in more detail below 
(p. 462). For the present, it is sufficient to note that an oscillo- 
scope enables us to depict the vibrations of an electron beam in two 
mutually perpendicular directions. The trace of an electron beam 
on a fluorescent screen describes a path that is the result of two 
mutually perpendicular vibrational motions of the beam spot. 

Let us assume that the vibration of the beam trace in the ver- 
tical direction is represented by the relation y = b cos (wt-+ 6), and 
in the horizontal direction by the relationz = a cos at. To deter- 
mine the nature of the resultant path, we must eliminate. time from 
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the above equations and obtain an equation of the form/f(z, y) 


ala || 


Writing the expressions-for the displacements in the form 
= cos wat, + = cos (wt-+6) = cos wt cos 6—sin wt sin 6 and 
replacing, in the second equation, cos wt by = and sin wt by 


Vie , we obtain after simple conversion the equation of 
an ellipse rotated with respect to the coordinate axes: 
x2 y2 ary 


ine 
7 ee cos 6 = sin? 6. 


Let us now vary the parameters of the vibrations and see what 
happens to the ellipse. If we vary the phase difference, the ellipse 
will change its form and simultaneously rotate (Fig. 54). 


y y y 
d= 0° S=90° O=180° 
Fig. 54 


When the phase difference is equal to 90° the axes of the ellipse 
coincide with the coordinate axes. If the phase difference is decreased 
or increased, the ellipse 
begins to rotate to the left 
or to the right, respectively, 
and simultaneously contracts. 
When the phase difference is 
reduced to zero, the ellipse 
degenerates into a straight 
» line. The various cases can 
><) be checked by substituting, 
in turn, the values § = 0°, 
90° and 180° in the above 
equation. 

If the amplitudes of the vibrations in the vertical and horizon- 
tal directions are equal, then for phase differences of 90° and 270° 
the path is a circle. There is a difference between these two phase 
differences in spite of the fact that the paths are identical. In one 
case, the beam moves around the circle in a clockwise direction, 
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while in the other, the motion is counterclockwise. To see this, 
let us return to the original equations. We obtain the following: 


for 90° x=acosot, y=bcos (w+ 90°); 
for 270° x=acosat, y=bcos(wt+ 270°). 


The first pair of equations shows that for increasing time, at t = 0, 
the point having the coordinates x = a, y = 0 begins moving in the 
direction of negative y, i.e., clockwise. The second pair of equations 
shows that in this case the direction of motion is counterclockwise. 

In viewing an oscillogram, it will be observed that the ellipses 
do not stand still, but slowly shift as though a continuous change 
in phase were occurring. Careful observation shows that the ellipse 
does not rotate, but the curve being traced by the beam spot seems 
to continuously shift from one ellipse to another. This phenomenon 
occurs when the frequencies of the vibrations differ somewhat. In 
fact, a difference in frequency is entirely equivalent to a continu- 
ously changing phase difference. Let us assume that the frequency 
of the vertical vibration w, is Aw greater than the frequency of 
the horizontal vibration @,. Then, 


Wt + 6 = Wet + (Aw? + 4), 


where the variable phase difference is within the brackets. 

If the frequencies differ considerably from each other, before the 
beam is able to describe the major portion of one ellipse the phase 
has already changed. As a result, the described curves look less 
and less like ellipses. Examples of these queer curves are shown in 
Fig. 55 and are known as Lissajous figures. The depicted curves 
are for a frequency ratio of 3: 4. 


CHAPTER VI 
TRAVELLING WAVES 


32. Propagation of a Disturbance 


Every body is elastic to one or another degree, i.e., every body 
is able to restore itself to its original form after being distorted 
by a force of short duration. This property is responsible for the 
fact that every mechanical action is transmitted by a body with 
finite velocity. If a perfectly rigid rod, incapable of being deformed, 
existed, it would only be able to move as a unit, and the action 


of a force would dissipate in such a body instantaneously. If a per- 
fectly plastic body existed, deforming without in the least restoring 
itself to its original form, it would be incapable of transmitting 
any mechanical action whatsoever. 

In an elastic body a disturbance is transmitted successively 
from one particle of a body to a contiguous one. The compression 
produced at the end of a rod due to the blow of a hammer is prop- 
agated along the body with a definite velocity c. If a bend of short 
duration is created at some point of a rigid body, this disturbance 
will also be transmitted with finite velocity through the body. The 
same is true of every deformation. Propagation through a body 
for various mechanical deformations is usually demonstrated by 
means of a spring (Fig. 56). 

Elasticity of compression and extension is a property of liquid 
and gaseous bodies, as well as of rigid bodies. Hence, these dis- 
turbances may be transmitted in all bodies. However, the disturb- 
ances produced by shearing, torsional and bending deformations 
may be transmitted only by rigid bodies possessing the corresponding 
elasticity. 
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For compression and extension, the motion of the particles is 
in the direction in which the mechanical action is transmitted. 
Hence, the propagation of the disturbance is said to be longitudinal. 
In the case of shearing, torsion and bending, the direction of motion 
of the particles may form, generally speaking, any arbitrary angle 
with the direction in which the energy is transmitted. 

We can always select the direction in which the mechanical 
action is being transmitted and then resolve the displacement of the 
particles of the body into three mutually perpendicular components, 
whereby one of the components is in the direction of propagation 
and the other two are in a perpendicular plane. Thus, in the most 
complex case, we can consider the propagation of a disturbance as 
consisting of the sum of three motions—one longitudinal and two 
lateral. 

The velocity of propagation of the disturbance due to an elastic 
deformation depends on the mechanical properties of the body. 
As is shown in theoretical physics, this velocity is related to other 
physical constants of the body. Thus, for longitudinal waves, it 
is given by the simple formula: 
fi 

V px 
Here, pe is the density of the body and x is the compressibility. 
A large density leads to an increase in the inertia of the body’s 
particles and, consequently, the velocity of propagation of the 
elastic waves decreases. The small values for compressibility indi- 
cate that even large elastic forces correspond to only small deforma- 
tions. The smaller the compressibility, the greater the velocity of 
propagation of the disturbance. 

This is the form in which this formula is generally used for liquids. 
Thus, water compresses by 5 X 10-5 of its volume for a change in 
pressure of 1 atm. This means that the compressibility, equal by 
definition to 

1 Av F _g cm? a 
X= aay Aa ah p. 157), 1S {0-5 dyne x 5) x 40-5, 
The density of water is 1 gm/cm’. Hence, for the velocity of prop- 
agation in water, we obtain , 


c? =2 x 10! cm?/sec?, 
c=1,400 m/sec. 


c 


For gases, it is convenient to convert the formula for velocity 
jnto another form. Since the process of transmitting compression 
jn a gas is very rapid, the compression and expansion of a gas may 
pe considered to occur adiabatically, i.e., without heat exchange. 
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We shall derive the equation of the adiabatic process below (p. 170), 
from which it is easy to obtain the following relationship between 
the coefficient of compressibility and the pressure of the gas: x = 
= oF , where y + 1.4*. Thus, c= Vz .For an ideal gas, the density 

a 


Vv 


is proportional to the fraction PE , where w is the mass of 


= T 


a ae v 
a mole of gas and v is its volume. This is so because F = const. 


Hence the velocity of propagation in a gas is 


Vor 

C= a—. 

lt 

Here, a is a constant whose value is easily calculated by means 
of equations considered below (p. 170). 

Thus, the velocity of propagation of the disturbance due to a defor- 
mation in a gas, including the velocity of propagation of sound 
waves, which will be discussed in more detail later on, is proportion- 
al to the square root of the temperature and does not depend on 
the pressure of the gas. The dependence on the molecular weight is 
interesting. In hydrogen, the velocity of propagation is equal to 
1,263 m/sec, while in air, as is well known, it is 331 m/sec. 

For longitudinal waves propagating in a rigid body, the coeffi- 
cient of compressibility is usually replaced by the modulus of 
elasticity. Since, by definition, the modulus of elasticity 


ra+..+! , AL 


ee A aay ee, 


it is evident that in the absence of transverse motion “ae, for 


the linear compression is equivalent to the volume compression. 
The formula for velocity may then be written as follows: 


Ve 
c= _. 
p 


* The equation of the adiabatic process is pv’ = const. If p and v are the 
equilibrium values of the pressure and volume for a certain mass of gas, and 


pt Ap and v — Av the corresponding values at the instant of deformation, 
en 


(p+ Ap) (v—Av)* = pv’, 


: = = 2 
Whence, 1-42 = (4 —*) Yst—y At -1h—) (=) 4... Dis- 
regarding terms of higher order in the binomial expression, we obtain 


Ap=—yp oe . Hence, n= F 
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The table shows the extent of the agreement between calculated 
and experimental values: 


: F ‘5 Density alc. xp’ 

Young's Modulus FE ST asec” Ceres 
Glas§: 4.6 4 222 New ahs 7.8105 kg/cm? 2.4 5,700 5,990 
Steel .......04., 2,200 kg/mm? 8 5,200 5,000 
Wood ......26- 7.2x 105 kg/cm? 0.7 4,130 4,200 
Water (138°C) ..... %=4.75x 1075 atm71 1 1,450 1.440 


To check the formula for the velocity of propagation of sound, it 
is necessary to use samples in the shape of slim rods. This is due 
to the fact that a more thorough analysis of the problem indicates 


aa 
that the formula c= V a is only valid for such bodies. For bodies 


having other shapes, as well as for the propagation of sound in 
a continuous medium, theory leads to expressions which we shall 
not introduce. 

It should also be noted that the values given in the table are 
only for guiding purposes. The velocity of sound in different types 
of glass, wood, steel, etc., differ considerably. 


33. Generation of Wave Motion 


Sustained vibrations may be applied to a particular point of 
a body or medium in a variety of ways. A force acting periodically 
at some point of a body produces a periodically varying deforma- 
tion whose disturbance is transmitted at a specific velocity from 
one point of the body to another. All the particles of the body 
participate in the vibratory motion. Since the velocity of prop- 
agation is finite, however, the particles of the body are set into 
vibration in consecutive order. If a body is infinitely large, such 
a vibration will advance continuously, forming a travelling wave. 

Infinitely large bodies do not exist. However, the actual length 
of a large body does not affect the nature of the phenomenon, for 
the vibrations do not reach the end in view of inevitable ener- 
gy losses. 

Let us consider a wave travelling in a particular direction in 
a body that for all practical purposes is infinitely large. Assume 
that the particle located at the origin of the coordinate system is 
vibrating in accordance with the equation y = A cos ot. Let us 
write the equation of vibration for a particle located along the 
line of propagation of the disturbance at a distance x from the ori- 


8—1409 
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gin. It is not the same as for the particle at the origin because this 
particleSbegan to vibrate after a delay of t=, the time required 
for the disturbance to be propagated the distance z. The vibration 
of particle x, therefore, is shifted in phase with respect to the vibra- 
tion of the particle at the origin. At the instant of time ¢, the vibra- 


tion of particle x will have the same phase as the vibration of the 
particle at the origin at an instant of time = earlier. Hence, the 


equation of vibration for a particle displaced by a distance x from 
the origin is 


y= Acoso (t—=) : 


where a. is the phase shift. 


The above equation is known as the wave equation. It is valid 
for the vibration of any particle located at any distance from the 
origin. 

Let us assume that the source of the wave is far from the observ- 
er and that the wave front has long since moved ahead. We now 


—— > Motion of the wave Motion of the wave<— 


Fig. 57 


consider a portion of the line along the z-axis subject to wave 
motion. At first glance, the introduction of a new concept may appear 
unjustified. To be sure, vibrations occur at all points in this region. 
However, can we discern how the wave is moving? Is it moving 
to theright or to the left? Careful observation shows that the specific 
character of the wave motion is easily detected. If the wave is 
moving from left to right, the adjacent point on the right will be 
delayed in phase with respect to the point on the left. In the reverse 
case, the point on the right will be ahead in phase. Fig. 57 shows 
waves travelling to the right and to the left. Each sinusoid repre- 
sents a snapshot of the wave at a particular instant. At each suc- 
ceeding instant, this sinusoid is displaced in its entirety in the 
direction in which the energy is being transmitted. 


34. Pressure and Velocity of Vibrations : 115 


It is clear from this that thef the wav direction oe motion affects 
the form of the wave equation. If the wave moves along the coordi- 
nale axis in the posilive direclion, a minus sign must precede the 
value of the coordinate x. If the wave moves along the coordinate 
axis in the negative direction, the sign in the argument of the 
cosine must be reversed: 


xz zx 
y= Acoso (t—=) ; y= Acoso (¢+—) : 
pos. direct. neg. direct. 


The wave equation at an instant of time equal to a multiple of 
the period reduces to 
= ee on — 
y = Acos o— A cos 21 = : 
The minus sign is dropped since the cosine is an even function. 
From this equation, it immediately follows that the period of the 
sinusoid is 
AN=cY. 


This spatial period, i.e., the distance covered by an undulation 
before it repeats itself, is known as a wavelength. We have thus 
arrived at the well-known relation connecting the velocity of a wave, 
ils length, and the period of a vibrating particle. 

A number of physical quantities vary sinusoidally in the undula- 
tory transmission of a disturbance through a body, namely, the 
displacement of a particle from its equilibrium position, the velocity 
of the vibrating particles, the pressure and the density. Hence, the 
above wave equation is very general. The quantity y may designate 
any of the enumerated physical quantities, which vary sinusoidally 
when the wave moves in the z-direction. It should be noted, of 
course, that the waves of pressure, velocity and displacement do not 
necessarily have to coincide in phase. For example, it is clear that 
the wave representing the velocity of the vibrating particles is shift- 
ed in phase by 90° with respect to the wave of the displacements, 
for the velocity of a particle is a maximum when it passes through 
the equilibrium position. 


34. Pressure and Velocity of Vibrations 


It is interesting to examine the relationships between the wave 
amplitudes of various physical quantities. For this purpose, we 
shall only consider longitudinal waves propagating in a gas and 
concern ourselves with waves of displacement, particle velocity and 
incremental pressure. Since the theory arose in connection with 
auditory waves, the incremental pressure Ap is often called the 


8* 
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sonic pressure and is designated by p, the symbol A being dropped. 

If A is the amplitude of the displacement wave, then oA is the 
amplitude of the velocity wave. These two waves are 90° out of 
phase. 

We shall now derive the relationship between the amplitude of 
the velocity of vibrations and the amplitude of the pressure. From 
the general definition of x as applied to gases (p. 112), we obtain for 
sonic pressure the formula 


where P is the pressure of the gas. time the relation c? =", we 
obtain 
6: AD 
sae eT 


It is perfectly natural that a direct connection should exist between 
the incremental pressure p and the relative compression in the gas 
at the same location. 

However, going a step further, the relative compression of the 


volume, “ , can be related to the displacement amplitude of the 


vibrating particles. Let us mark two points, x, and ze, along the 
line of propagation. For a longitudinal wave, changes in density 
occur as a result of displacements in the direction of propagation. 
Let us considera volume of gas bounded by the cross-sections through 
x, and x». When the wave moves, the molecules within this volume 
are displaced. However, it is only necessary to consider the situation 
at the limiting cross-sections. If the molecules of the layer through 2; 


are displaced by y; = A cosa (¢ — =1) and the molecules of the 


layer through x2 by yz = A cos w (¢ — 2), then the linear dimen- 
sion of the volume, xz, — z;, changes by the amount . ve — a The 
relative change in length, and hence in volume, is 4 . Going 


over to the limit, in order to obtain a quantity deseeiptise of a point 
in space, we obtain 


and for the pressure 


p= cp Aosin o (c=) ; 
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This shows that the pressure changes in phase at the rate of the par- 
ticle vibrations in the wave. Aw = Up is the amplitude of the veloc- 
ity of vibration. Thus. po, the amplitude of the pressure, is related 
to uo, the amplitude of the velocity, as follows: 

ay 


Po = PCy. 


In acoustics, uw is usually measured in cm/sec and p in dynes/cm’*. 
Using these units, we obtain po = 41uy for air at room temperature. 
The quantity pc is called the acoustic or wave resistance. The 
meaning of the designation is evident, namely, the greater the resis- 
tance the smaller the velocity of the vibrating particles for the same 
values of incremental pressure. 

The acoustic resistance of several materials is given in the table: 


| po c pc 

| (gm/e:n3) (cm/sec) Cee niteae ) 
Glass , 2.6 5.5x 105 14x 105 
Steel ..... 7.9 5x 105 40x 105 
Wood 0.7 4.2105 2.9 105 
Water 1 1.44 105 1.4 105 


35. Energy Flux 


Wave motion transfers energy from one location in space to anoth- 
er. However, it should be kept in mind that every particle of the 
medium is involved in the transmission of energy and each con- 
tinuously vibrates about an invariable equilibrium position. 

Since all the particles of a body are involved in the vibration, 
the vibrational energy of a unit volume is 


O’max 
w= max : 
2 


where p is the density, ise., the mass of a unit volume, and vy» qx is 
the amplitude of the velocity of vibration. Substituting for the 
-latler quantity the familiar expression 


Umax = oA ’ 


where A is the displacement amplitude and w is the frequency, we 
can write the density of the vibrational energy of a body in the form 


w2A2 
Hy cee 
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This energy propagates with a velocity c. The following question 
now arises: What is the expression for the wave intensity, i.e., the 
amount of energy passing in unit time through a unil area perpen- 
dicular to the direction of propagation of the wave? Instead of refer- 
ring to the intensity of a wave, however, it is more usual to speak 
about the vibrational energy flux, meaning thereby the energy 
passing per unit time (power) through a given area. This approach 
is completely analogous to the analysis of the flow of water in a pipe. 
In a unit time, a wave traverses a path c and transmits the energy 
contained in a cylinder of length c¢ and unit cross-section. Since 
a unit volume contains the energy w, the energy in the above cylin- 
der is wc. This is precisely the meaning of wave intensity: 


I=we. 


We see that wave intensity has the sense of energy flow through 
a unit area. This was first noted by. N. A. Umov in his theoretical 
work on energy motion in bodies. 

Until now, it has been assumed that the wave motion propagates 
along a straight line. Such an assumption is useful when investigat- 
ing disturbances travelling along rods, strings, air columns, etc. 
However, we are also interested in investigating cases involving 
three-dimensional wave motion. 

To describe a three-dimensional wave, we must know how the 
wave front moves. The wave front at a particular instant can be 
determined if all the points in space having the same phase of vibra- 
tion are given. Noting the successive positions of this conslant-phase 
surface, i.e., the wave front, we obtain a clear picture of the nature 
of the wave motion. 

Generally speaking, this surface may have any shape. In this 
case, then, what is meant by the direction of the wave propagation? 
It is natural to understand this direction to mean the normal to the 
wave front. 

If the medium is perfectly homogeneous and the wave emanates 
from some pvint in the medium, the wave front is spherical. Such 
a wave propagates along the radii from the centre. At large distances 
from the centre of radiation, large portions of the wave front will 
appear to be in a plane—within experimental accuracy. In this man- 
ner, the concept arises of a plane wave propagating in a direction 
normal to the wave front. If the wave radiator is of linear shape, 
a cylindrical wave propagating along the radii of the cylinder is 
generated. Various types of waves are shown in Fig. 58. 

Disregarding all energy losses that occur during the motion of 
a plane wave, we obtain that the quantity of energy passing through 
successive positions of the constant-phase surface remains unchanged. 
Hence, the intensity of a plane wave will not change during 
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the process of propagation. The situation is different, however, 
with regard to spherical and cylindrical waves. Since the constant- 
phase surfaces increase in area as the square of the distance for 
spherical waves, and linearly as the distance for cylindrical 
waves, the intensities of these waves are inversely proportional to 


the square of the distance and to the distance, respectively. Only 
in this manner can the law of conservation of energy be satisfied. 

The intensity of a wave is proportional to the density of the vibra- 
tional energy, which is proportional to the square of the amplitude. 
Hence, the amplitude of a spherical wave is inversely proportional 
to the distance from the radiating centre, and the amplitude of 
a cylindrical wave is inversely proportional to the square root of 


the distance from the linear radiator. Thus, y=4 cos @ (« — =) 


: 1 r 3; : 
for a spherical wave and y=—e cos @ (t — =) for a cylindrical 
wave. Here, the distance r, just as previously x, is measured along 
the direction of wave propagation. 


Let us assume that a source of vibrations having a frequency of 1 ke/s is 
placed under water. The energy flux J = 1 watt/cm?. Let us calculate the 
displacement amplitude A of the water molecules, their acceleration B and 
the amplitude A = uo of the vibrational velocity. 

From the formulas of previous articles, it follows that 


pk V 2% xs07 em; B-==oH 2! y 497 cm : 
pe pe 
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-,~ mM m 
For water, ¢ = 1,450 —— and p= 1 B — . Hence, 
sec cm3 
cm 


Avx1.9x10-3 cm; B74 : 
sec 


ran Ug © 12 —— 


For the same energy flux and frequency of vibration, we obtain the follow- 


ing results in air, where c = 330 = and p = 1.293 x 1073: 


A=0.11 cm; B=43 x 105 — a= 43, 


cm 
ip 100 
sec 


36. Damping of Elastic Waves 


In actuality, waves propagating in a medium (solid, liquid or 
gas) decrease in intensity considerably more rapidly than indicated 
by the inverse-square law. This is due to losses in mechanical energy, 
i.e., transformation of mechanical energy into heat. 

The relation expressing the decrease in intensity of some particu- 
lar radiation in passing through a medium may almost always be 
obtained by reasoning as follows (for any medium and any radia- 
tion): If a wave passes through a layer of thickness dz, the intensity 
loss should be proportional to the intensity of the incident wave 
and the thickness of the layer, i.e., dJ = — pldxz. 

This equation may be integrated. Assuming that the intensity 
is equal to J) at the point z = O and to J at the point x, we obtain 
a relation that is valid for finite distances: 


T x 
\ aa ‘ dz, i.e., T= Ipe-*. 
To 0 


Thus, the intensity of the wave decreases exponentially. 

In acoustics, it is convenient to use the concept of amplitude 
damping. Since the intensity is proportional to the square of the 
amplitude, the amplitude damping is expressed by a relation that 
differs from the above only in that the coefficient of damping (or 
absorption) is one-half of the value given there: 


1 
A=Ae 2", 
Let us examine the absorption coefficient p (ors 1) somewhat 


closer. It is measured in reciprocal centimetres (for the exponent 
must be a dimensionless quantity) and is equal to the reciprocal 
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of the thickness for which the intensity or amplitude of the radia- 
tion decreases by the factor e. 

Naturally, the exponential damping relation is only a partial 
solution to the problem of the absorption of elastic waves by a medi- 
um. The determination of the dependence of the absorption coeffi- 
cient on the properties of the medium and the radiation frequency 
is a more formidable aspect of the problem. 

It has been found that for many materials the damping of an 
elastic wave (most of the available data being for sound waves in 
air) increases with the frequency of the vibration. The relation for 
the absorption coefficient has been determined to be 


=a". 


For air, a = 4 X 10- sec*/em. Thus, over a distance of 1 km, 
a plane wave having a frequency of 100 cps decreases by a factor 
of ~1.015, while a very high audio frequency of 20,000 cps decreases 
by a factor of 1074! Ultrasonic vibrations are damped so rapidly 
that their transmission over a distance of more than several hundred 
metres is completely impractical. 

However, the monotonic relationship between absorption and 
frequency is not always satisfied. Some materials exhibit selective 
absorption of sound in a relatively narrow frequency band. Thus, 
the absorption of ultrasonic waves by carbon dioxide is a maximum 
at frequencies near 277 ke/s. The parabola calculated in accordance 
with the formula » = aw? closely matches the experimental data 
in all regions except in the band indicated above. At frequencies 
close to 277 kc/s, the absorption is about 20 times greater than that 
calculated assuming a parabolic relationship. 

The dependence of the absorption coefficient on the properties of 
the medium can be expressed as follows for longitudinal waves in 
gases and liquids: the absorption coefficient is inversely proportional 
lo the cube of the velocity of the elastic wave and directly propor- 
tional to the kinematic viscosity. As a result of this strong depend- 
ence on the velocity of propagation, and the fact that the kinematic 
viscosity of air is large, the absorption of sonic and ultrasonic waves 
in a liquid is about 1/1,000 of that in air. This means that for the 
same frequency elastic waves will propagate 1,000 times further 
in water than in air. 

The absorption of transverse waves in solid bodies is also strongly 
dependent on the properties of the body. Thus, the absorption in 
rubber, cork and glass is, respectively, 13,000, 8,500 and 130 times 
greater than in aluminium. 

Due to their complexity, we shall not go into the theories of elastic 
Wave absorption in bodies. 
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37. Interference of Waves 


If there are several sources of waves instead of just one, then each 
point of the medium is simultaneously subject lo several wave 
motions. It turns out that it is always possible to consider the vibra- 
tion of a physical quantity due to the action of several waves as the 
sum of the vibrations that would occur if each wave acted independ- 
ently. 

Let us assume that spherical waves emanate from two points 
located at a certain distance from each other. By means of the wave 
equation, we can determine the value of the vibration amplitude 
at any instant of time for any neighbouring point. If the point in 
question is located at a distance 7; from the first source of waves 
_ and at a distance rz from the second source, the vibration there is 
represented by the formula 

y = Acos 2x (w—4) + Acos 21 (w—) . 
The result obtained by adding two vibrations differing only in phase 
is, as we know, also a harmonic vibration whose amplitude is 


2A cos> , which can be seen to depend on the phase difference be- 


tween the component vibrations. The phase difference 6 is equal 
in this case to 


Qn (45°. ) : 


Thus, generally speaking, all points of the wave field under con- 
sideration will be in vibration. However, the amplitudes of these 
vibrations will be different at different points. Two extreme cases 
deserve attention. First, let us consider the points at which the 
component vibrations annul each other. These points satisfy the 
condition’ 


2n 1. = (2k +1) a, 
where k = 0, 1, 2, 3, ..., i.e., the phase difference is equal to an 
odd multiple of m. On the other hand, if 


dn t= "2 _ den, 


i.e., if the phase difference is equal to an even multiple of x, the 
amplitudes of the vibrations will add arithmetically. Thus, in this 
case, the amplitudes reinforce each other to a maximum degree. 

The difference r; — rz may be called the path difference of the 
waves and this term needs no further explanation. The conditions 
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of maximum and minimum amplitudes may be formulated some- 
what differently by means of this concept. The maximum condition 

ry—le= kn 
slates that the path difference between waves arriving at a given 
point must be equal to an integral number of wavelengths. The 
minimum condition 


nS 
ry—lg= > (2k +1) 
stales that the path difference must be equal to an odd number of 


half-wavelengths. These conditions are very easily visualised as 
follows: The waves rein- 


force each other when one a=3a 
crest is superimposed on 7 a=5f2a 
another, and annul each vo  d=2a 
other when a crest is su- a=3/2a 
perimposed on a_ trough d-a 
or node. _—— d= l/2a 
The superposition of 7 ume d=0 
waves, i.e., the addition 2======2—-——_~ --+-+ a=I/2A 
of their amplitudes, leads = | 
to interference. ~~ a=-3/24 
From analytic geometry d=-2A 
we know that a_hyper- Nghe 
bola is a curved line sat- BA 


isfying the condition that 
the difference of the dis- Fig. 59 
tances from any point on 
the curve to two foci is a constant. If we pass a plane through 
the point sources and note in the diagram points of maximum 
reinforcement and those of wave annulment, the points will fall on 
hyperbolas. The corresponding curves are shown in Fig. 59. Such 
a picture can be easily observed on the surface of water when two 
sources sending out ripples from neighbouring points set up an 
interference pattern. 

We can use the above method to analyse the interference of any 
number of wave sources. 


38. Principle of Tuygens-Fresnel. 
Reflection and Refraction of Waves 


The complete equality of all the vibrating points of a wave field 
is striking. They differ only with respect to phase. As a result, it 
is natural for the following idea to emerge: It should be possible 
to consider any point of a wave field as an independent source of 
spherical waves. 
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” The validity of this idea first formulated in 1690 by Christian 
Huygens, may be tested by attempting to construct a wave front 
from data on a wave field on a boundary surface. It is necessary lo 
take into account that there will be interference between the indi- 
vidual spherical waves (also called elementary waves or wavelets). 
Huygens’ principle, supplemented by Fresnel, shows that such 
a construction is possible. 

What is the significance of this principle? Let us assume that the 
wave falls on an opaque screen having several apertures. By means 
of the principle of Huygens-Fresnel, we can map the wave field 
beyond the screen without knowing anything about the sources of 


Fig. 60 


the field. It is sufficient to know the intensity of the field in the 
plane of the screen and assume that a spherical wave propagates 
from each point on the screen. The amplitude of the wave at any 
location in space is determined by adding all the wavelets coming 
from the apertures in the screen. . 

Postponing consideration of the problems related to the passage 
of waves through a screen (problems which are mainly of interest 
in connection with light waves), we shall now apply the principle 
of Huygens-Fresnel to the explanation of the phenomena of wave 
reflection and refraction. 

Let us consider a portion of a plane wave incident on the bounda- 
ry between two media. As is well known, a wave of any origin is 
reflected at an angle equal to the angle of incidence. But why should 
this occur? Huygens’ principle gives the explanation. Every point 
on the boundary between the media may be considered as a wavelet 
source. The first wavelet emanates from the point first reached by 
the incident wave. Successive points on the boundary will 
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then be excited, the last point to start vibrating being the one 
last reached by the incident wave. Fig. 60 shows the positions 
of the wavelets for the instant of time when the incident 
wave reaches the last point. The wave front generated by the wave- 
lets forms an angle with 
the boundary equal to that 
of the incident wave. 
Thus, the propagation ve- 
locities of the incident 
wave and the reflected 
wave are the same. This 
means that the radius of 
the largest sphere must be 
equal to the path trav- 
ersed by the incident wave 
from the instant the first 
point was excited to the 
instant the last point was 
excited. 

In exactly the same 
manner, the wave front of 
areflected spherical wave can be easily constructed. This construc- 
tion is shown in Fig. 61. In Fig. 62, a photograph is shown of 
a sound wave reflected by a 
wall. 

Let us now consider wave- 
lets penetrating into the sec- 
ond medium and generating 
arefracted wave front (Fig. 63). 
The different media have dif- 
ferent densities and elastic 
j / properties. Hlence, the wave 
} | propagation velocity also dif- 
\ fA fers for each medium. Let us 
el nthe now ‘perform the same con- 
EAS 4 struction as for reflection, i.e., 

draw wavelets on the diagram 

Fig. 62 for the instant when the inci- 

dent wave reaches the last 

point. The wave front is rotated due to the difference in propagation 
velocities. If the wave has penetrated into a denser medium, the 
radius of the largest wavelet should be less than the path traversed 
by the incident wave from the instant of excitation of the first point 
on the boundary to the instant of excitation of the last point. More- 
over, the ratio of these lengths should be equal to the ratio of the 
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wave propagation velocities. On the other hand, as can be seen from 
Fig. 63, the ratio of the above distances is equal to the ratio of the 
sine of the incident angle and the sine of the refraction angle. Thus, 
we arrive at the well-known relation for wave refraction: 


The direction of the propagation will be deflected toward the normal 
to the boundary if the wave moves into a medium of greater density 
and, on the other hand, if it 
moves into a medium of lesser 
density, it will be deflected 
away from the normal. The 


é c . 
ratio =n is known as the 
2 


refraclive index. 


39. Reflection Coefficient 


The explanation of the ge- 
ometry of reflection and refrac- 
Fig. 63 tion may appear to be a 
somewhat uninteresting appli- 
cation of the theory. However, the wave theory enables us to do 
considerably more, namely, ascertain the relative proportions of the 
reflected and refracted waves as functions of the properties of the 
media whose interface is being considered. In order to simplify the 
calculations, we shall limit ourselves to the simple case of the 
normal incidence of a longitudinal wave on the interface of two 
media. The nature of the proof, however, is the same for all 
conceivable cases. 

In a discussion of this type, the following is axiomatic. At the 
boundary between two media, neither w; the velocity of the vibra- 
tions of particles, nor p, the incremental pressure, can change abrupt- 
ly. It is intuitively evident that it cannot be otherwise, but this 
can be shown rigorously on the basis of fundamental laws of physics. 

On one side of the boundary, there are waves having the instan- 
taneous velocities Ujncig and U;esiecy. On the other side of the bound- 
ary, thereis also the wave having the instantaneous velocity Uyefract- 
The continuity of velocity yields the condition: Ujncig + Urefiect = 
= Urefract; the continuity of pressure yields: Ujncia P1C1 + Urefiect P1C1= 
= Urefract P2C2. However, examining these two equations, we see 
that they are incompatible since pyc; ~ Poco. How is this to be 
explained? The answer is that we have forgotten that the instanta- 
neous values of the velocities and pressures are vector quantities 


389. Reflection Coefficient 127 


and that even in the simple case when the displacement vectors are: 
in a single plane the amplitudes may differ in sign. It can be seen 
that the equations become compatible only if the amplitudes of the 
velocity and pressure for the reflected waves are given opposite 
signs. The equations of continuity are then written in the form 


Wincid + Ureflect = Urejracts (Uincid —Ureslect) P1C1 = Ure fractP2C2 
or 
Wincid — Ureflect = Urefracts (Uincia + Ure flect) 01Cy = Ure fractP2Co- 


We leave it to the reader to show that all other combinations of 
signs will fail to make the equations compatible. 

Since the amplitudes are positive quantities, the sum must be 
greater than the difference. Hence, the first pair of equations is 
valid when p,c; > Pece and the second pair is valid for the reverse 
case. The first pair of equations arises when all the vibration veloc- 
ity amplitudes are in one direction and the phase of the reflected 
pressure wave differs by 180°, i.e., the amplitude of the reflected 
wave is oppositely directed with respect to the incident and refract- 
ed waves. The second pair corresponds to the reverse case. 


P1C1 > Pace £101 < pace 
Velocity wave Pressure wave Velocity wave Pressure wave 
incident > incident > incident incident —> 
reflected > reflected <— reflected <— reflected > 
refracted refracted > refracted > refracted > 


This interesting phenomenon of amplitude vector reversal in 
reflection may be described as a one-half wavelength loss or a 180° 
phase jump. Thus, the change of sign in the wave equation 


y = Acos w (¢ — =), where y is any physical quantity, may be 


obtained by introducing a 180° phase shift in the argument of the 
cosine. On the other hand, a 180° shift in phase is equivalent to. 
displacing the wave distribution by one-half wavelength. 

Thus, at the interface of two media, the incident and reflected 
waves act either to reinforce each other or to annul each other to the: 
maximum extent possible. ' 

It should be recalled that in reflection a one-half wavelength loss: 
occurs for the vibration velocity wave when it enters a medium of 
greater resistance (sometimes inaccurately expressed as a medium 
of greater density). The displacement wave is inseparably linked 


128 Travelling Waves 


with the vibration velocity wave and also suffers a one-half wave- 
length loss. 

In entering the second medium the wave does not execute a phase 
jump. 

Solving the above equations simultaneously, we obtain the expres- 


F A j : Ureflect 
sion for the reflection coefficient r, i.e., — ~~~ 


Thus 


. ——. 
“incident 


P1io1 — Pace 
Pyey-+Pp2ce ” 
where r isalways > 0. Similarly, the refractive index g, i.e., 
18 _ __ 2pyey 
ea Prey + Pte © 

The wave resistance in air is much different from that in solid 
bodies. As indicated above, pc = 41 for air, while for steel (0 = 
= 7.9 gm/em*® and c = 5,000 m/sec) pe = 40 x 10°. Thus, r = 
= 0.99999. This means that sound incident from air on steel is 
practically completely reflected and in effect does not penetrate 
the latter. It can be easily calculated that at the boundary between 
air and water r = 0.9997. 


40. The Doppler Effect 


Until now it has been assumed that the wave source and the receiv- 
er (i.e., the observer) were both stationary with respect to the 
medium in which the wave was propagating. Various effects, which 
were first noted by Doppler (1842), occur when the source or the 
observer, or of course both, move with respect to the medium. They 
consist basically in the fact that when the wave source moves the 
observer measures the vibration frequency v’ and when the observer 
moves he measures the vibration frequency v”. These frequencies 
dilfer from each other and from the frequency v that is measured 
when the observer and the source are stationary. 

In considering the Doppler effect, it is necessary in the first 
place to note that the wave leaving the source propagates entirely 
independently of the motion of the source and the observer. There- 
fore, in moving relative to the medium, the source or the observer 
may approach or recede from the moving wave. 

Why does such motion lead to the measurement of a frequency 
that differs from its “real” value? This is because the observer deter- 
mines the vibration frequency as the number of waves entering his 


apparatus per unit time. On the other hand, the formula vey 


r= 
Ure fract 


ir nat ae ? 
Uincident 


gives the number of waves emitted per unit time. If the observer 
moves toward the source with the velocity u, then in 1 sec the num- 
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ber of waves Lhat he measures is not v, but a number larger than 
this. Moreover, the ratio of c -- u, the relative velocity of the wave 
and the observer, to c is the factor by which the measured value is 


larger. Thus, 

—_ ¢ = ; v=v(l +4) F 
If the source moves toward the receiver, the observer will again 
register a larger number of waves than when the source and the 
receiver are stalionary. [Tlowever, in this case, the reason for the 
increase is different. 

This is not evident at first glance. The motion of a source having 
a fixed frequency of vibration leads to a change in the distances 
between points of equal phase of the wave. If the first case is con- 
sidered to be crudely analogous to Lhe motion of an observer toward 
a column of athletes running at equal velocities and maintaining 
the constant distance 4 between them, then the second case clearly 
requires a different interpretation. This case can be visualised as the 
slow displacement of the line of start. At equal intervals of time, 
the runners jump from an automobile moving along the track, which 
leads to a change in the distances between them. This distance 
becomes A’, not A’. If the line of start (the source) is displaced in 
the direction of the observer and v runners start each second, then 
in 1 sec they are distributed over a distance given by c — u. Thus, 


. . C—l 
the interval between runners (wavelengths) is Ma, The fre- 


quency at which the runners moving with velocity ¢ cross the finish 
line, i.e., the vibration frequency perceived by the observer, is 


” 


Cc 
v = Fr Vv =v 


Both of the above formulas are also valid when the source and the 
observer are moving apart. In this case, it is merely necessary to 
reverse the sign of wu. 

Thus, it has been shown that when the source and the observer 
approach each other the measured frequency of vibrations radiated 
by the source increases. When the source and the observer move 
apart the frequency decreases. 

A well-known example of the Doppler effect is the change in sound 
of the whistle of a locomotive as il passes an observer. When the 
locomotive approaches the observer the frequency of the sound is 
higher than the real frequency. The pitch changes abruptly when the 
locomotive sweeps past him. In receding, the frequency of the sound 
perceived is lower than the real frequency. Tor a train moving at 
a velocity of 70 km/hr the jump amounts to ~12 per cent of the 
real frequency. 
9~1409 


CHAPTER VII 
STANDING WAVES 


41. Superposition of Two Waves Travelling 
in Opposite Directions 


Let us assume that two plane waves having exactly the same 
characteristics are moving in opposite directions. We are interested 
in the resultant vibrational motion of the medium in which the 
waves are propagating. 

As indicated above, a difference in the direction of propagation 
is taken into account by a difference in the coordinate signs in the 
wave equation. The resultant displacement should, therefore, be 
given by the expression 


y = Acoso (2 —=) + Acoso (2 +=) = 2A cos cos Of = 


2x 


= 2A cos ‘ 


COS of. 


This result is very interesting, for the sum of two travelling waves 
has not yielded wave motion. The formula obtained indicates the 
presence of vibrations of amplilude 2A cos = , Whose numerical 
value depends on the location in space. We call this peculiar vibra- 
tional state of the medium a standing wave, which arises whenever 
two identical travelling waves move in opposite directions. It should 
be emphasised that a standing wave is not a wave in the usual 
sense. A travelling wave transfers cnergy from one point to another, 
but this is in no way true of a standing wave. A travelling wave 
can move to the right or to the left, but a standing wave has no 
direction of propagation. The adopted designation merely character- 
ises the vibrational state of the medium. 

What are the characteristic features of this vibrational state? 
In the first place, we see that not all the particles of the medium 


vibrate. At the points in space satisfying the condition rat, 


o ’ = , .. +, the vibration amplitude is equal to zero. These points 
are known as the nodes of the standing wave. The distance between 
fwo adjacent nodes along the z-axis, the direction of propagation 
of the travelling waves, is equal to one-half wavelength. Between 


every Lwo nodes is a point that vibrates with a maximum amplitude 
of 24. Such points are called the antinodes of the standing wave. 

Fig. 64 shows the vibrational state corresponding to the standing 
wave at several successive instants of time. We see that the adopted 
designation is fully justified. At cach instant a wave can be seen, 
but the wave does not move. A series of consecutive snapshots will 
show that the points of intersection of the wave with the abscissa, 
i.e., the nodes, remain fixed. The 
wave is stationary and the only 
change occurring between snapshots 
is a change in the magnitude of 
the displacements. At a certain in- 
stant, all points of the medium are 
motionless. After this instant of 
time, points that previously diverged 
upwards will now diverge down- 
wards, and vice versa. It is evident 
that this picture has nothing in com- 
mon with the travelling wave shown 
in Fig. 57, where two comparable 
“snapshots” are depicted. There the 
wave is seen to move. At each suc- 
cessive instant, the maxima and min- 
ima of the wave pass to new loca- 
tions. 

We stated that no energy transfer 
occurs in astanding wave. Then how 
can we describe, in terms of energy, 
the processes occurring in this pe- 
culiar vibrational motion? Clearly, 
the energy of a standing wave (for : 
some region in which it exists) is Big Gi 
a constant quantity. At the instant 
when all the particles are passing through the equilibrium, position, 
ail the energy of the vibrating particles is kinetic. On the other 
hand, at the position of maximum deviation of the particles from 
the equilibrium position, the energy of all the particles of the body is 
potential. 

A standing wave is a very important vibrational process. Stand- 
ing waves of different types arise in bodies of limited dimensions 
through which elastic waves are propagated. This is because elastic 
waves are reflected back from the boundary into the body. A com- 
plex vibratory state arises in the finite body which is the result of 
the superposition on the source wave of all the waves that were 
reflected from the walls. Several typical cases will now be considered. 


g* 
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42. Free Vibrations of a Rod 


By means of a blow or otiier means, it is possible in any solid 
rod -to excite a longitudinal wave that propagates along the length 
of the rod. From the opposite end of the rod, this wave is reflected 
and in this manner the entire rod is put into a vibratory state re- 
presented by astanding wave. The state will be one of free vibrations 
since it arises as the result of an impulse of short duration and con- 
tinues without the action of external forces. We can predict the 
behaviour of these free vibrations if we know the length of the rod 
and how it is fixed. These data are known as the boundary condi- 
tions. It will be found that a node of the standing wave is located 
at the point where the rod is fixed and an antinode of the standing 
wave is located at the free end. 

We shall now consider several modes of excitation of free, longi- 
tudinal vibrations in a rod of length ZL. 

A Rod Fixed at Both Ends. In this case, nodes of the displacement 
wave are formed at the ends of the rod. Since the distance between 
nodes is equal to one-half wavelength, the wavelengths that are 
possible, in terms of the length of the rod, are given by the condition 


he ao i.e., hn = ee, where n is any integral number. 


Using for the velocity of an élastic wave the expression c = Ve : 


and recalling the relationship between frequency and wavelength, 
we oblain the expression for the natural frequencies of the free 
longitudinal vibrations of the rod: 


aay = 
Be 2G o- 

The qualitatively new content of this result should be noted. 
A solid body does not have one, but amulliplicily of natural (char- 
aclerislic) frequencies of vibration. Hence, a rod can execute 
a variety of free vibrations. It is also possible for a rod to perform 
nonharmonic vibrations having any arbitrary spectrum * consisting 
of the frequencies v,. 

The frequency v, is the fundamental frequency of vibration of the 


rod. It corresponds to the vibratory motion for the condition L=%. 


This means that for the fundamental vibration an antinode of the 
standing wave is at the centre of the rod and there are no nodes be- 


* The word “spectrum” is used quite often in physics to denote a set of par- 
ticles having different velocities, masses, etc., or a set of waves having dilfer- 
ent wavelengths (frequencies), etc. 
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tween the ends. The vibrations of the second overtone (second har- 
monic) correspond to the condition L = %. Now, there is a node 
at the centre of the rod. If the third harmonic is excited, there will 
be two nodes between the ends of the rods, etc. 


Example. For a steel rod (p = 7.7 gm/cm3 and E = 21,000 kg/mm?) whose 
Jength is 7 metres, the fundamental frequency is vy = 365 eps. 


A Rod Free at Both Ends. If a rod is suspended by a thin string 
and then vibrations excited in it, the resultant standing wave must 
salisfy the condilion that antinodes are located at both ends of the 
rod. Just as in the previous case, the connection between the length 


of the rod and the wavelength is expressed by the relation: L = ne : 


Hence, the formula for the natural frequencies will also be the same. 

The difference between this case and the previous one is in the 
distribution of the nodes and antinodes. For the fundamental vibra- 
tion, the centre of the rod is at rest (node). If the second harmonic 
is excited, there will be an antinode at the centre; one-quarter wave- 
lengths away—nodes; and at the ends—antinodes. 

A Rod Fixed at One End. In this case, there will be a node at the 
fixed end and an antinode at the other end. For the fundamental 
vibration, the rod has a form corresponding to one-quarter of a peri- 
od of a sinusoid. Since the distance between a node and an antinode 


is equal to -, the relationship between the wavelengths and the 
length of the rod is given by the condition 


av 
L=n7, where n=1, 3, 5,. 


The natural frequencies of the vibrations of such a rod are given 
by the formula 


n E ; 
= te V Zm=t, 5 eee 


In the first two cases, the frequencies are related to each other as 
the whole numbers. Here, they are related-to each other as the odd 
numbers. 

A rod fixed at the centre will have a node at the fixed point and 
antinodes at the ends. The problem is essentially the same as above. 

The boundary conditions used in the consideration of the vibra- 
tory state of a rod are an extreme case of the boundary conditions 
for reflected waves, considered on p. 127. As was explained earlier, 
reflection from a boundary separating one medium from another 
medium of greater resistance is accompanied by a loss of one-half 
wavelength in the displacement wave. If the rod is fixed, the wave 
does not penetrate the second medium at all. In this case, the second 
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medium can be said to have an infinitely large resistance. The cvef- 
ficient of reflection is equal to unity and the reflection is accompa- 
nied by a lossof one-half wavelength. It is not difficult to see that 
this corresponds to the presence of a node at the boundary between 
the two media. The reflection of the wave from the free end of the 
rod corresponds to reflection from a medium having zero resistance. 
A reflection coefficient equal to unily and the absence of a half 
wavelength loss leads us to conclude that an antinode must exist 
at such a boundary. 

Free longitudinal vibrations may also be excited in columns of 
liquid and columns of gas. 

Free lateral vibrations are easily excited in a string under ten- 
sion. The distribution of the nodes and antinodes will naturally be 
the same as for a rod fixed at both ends. The set of frequencies is 
expressed by a formula analogous to that derived for the rod, the 
only difference being that in the expression for the velocity of the 
lateral wave it is necessary to replace & by the tension, i.e., the 
force stretching the string divided by the cross-section. 


43. Free Vibrations of Two-Dimensional 
and Three-Dimensional Systems 


In rods, strings and air columns, the constant-phase surfaces of 
equal phase consist of parallel planes. The vibratory state may be 
conceived as the result of superimposing plane waves extending 
along a single line. However, more complex cases are possible. Thus, 
we can have vibratory motion encompassing a two-dimensional 
region, e.g., plates and membranes, or encompassing a body whose 
three dimensions are of equal order of magnitude. 

The vibration of elastic and rigid diaphragms is a two-dimensional 
problem. A plate fixed at its edges will have a different mode of 
vibration than a plate fixed at a single point or not fixed at all. 
Apart from the vibration of rigid plates, vibration of stretched 
nonrigid films, e.g., rubber and soap films, is also encountered. 

In principle, the general behaviour of the freee vibrations in this 
case does nod differ from that already considered. Since this is a 
two-dimensional problem, the nodes and antinodes will now con- 
sist, in general, of curved lines. For example, the fundamental 
vibration of a circular plate fixed along ils circumference has a sin- 
gle antinode (a point in this case) at the centre of the circle, i.e., 
the central point vibrates with maximum amplitude. As we move 
toward the edge, where a nodal circumference is located, the ampli- 
tude gradually decreases while maintaining circular symmetry. This 
is the simplest case, namely, vibration of the fundamental (lowest) 
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frequency. A membrane may be excited at a higher harmonic; in 
such a case the surface is broken up by nodal lines. It turns out that 
nodal lines in a circular plate may have a circular form or consist 
of diameters passing through the centre. 

The demonstration of nodal lines by the Chladni method (named 
after the scientist who proposed it) is an effective and simple expe- 
riment. A plate sprinkled with sand is put into a vibratory state by 


Fig. 65 


means of a blow or a fiddlestick. The sand rolls away from the anti- 
nodes and gathers along the nodal lines. Fig. 65 shows several Chlad- 
ni figures. : 

The vibratory state of a solid three-dimensional body is, of course, 
the most complex. We shall avoid consideration of this phenomenon 
in a body of complex form and restrict our study of such free vibra- 
tions to a right-angled parallelepiped. If the standing waves in 
such a body were only due to the superposition of waves travelling 
parallel to an edge of the parallelepiped, the natural frequencies 
of the vibrations would be limited to the values 

nyc Noe n3e 

co ae Pe) 
where ,, m2, 23 are arbitrary whole numbers, and J;, J2, 13 are the 
lengths of the edges of the parallelepiped. 

However, the waves propagating in the body may form, in gener- 
al, any angle with the boundaries. In this case, the standing waves 
are formed after a number of reflections, when the beam returns 
to the exact point from which it left. Such an undulatory process 
may be conceived as the superposition of waves of different frequen- 
cies travelling along different edges of the parallelepiped. Their 
natural frequencies of vibration are: 
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and the wave numbers are 


n no n 
ky= sat, kp =e, hgh. 
ee eee) ee ee 


In superimposing the waves, k is determined by addition, bul it 
should be noted that the addition is vectorial. Thus, 
i272 es c ni n§ nz 
k=Vii-e+k, ie., v=SVetetE: 

It is evident that the frequencies of vibration for the simple cases 
of wave propagation parallel to the edges of the body are also obtain- 
able from this formula; in such a case only one of the three whole 
numbers in the formula is different from zero. 

The vibration spectrum of a three-dimensional body is depicted 


in Fig. 66 in three-dimensional space, which may be called frequency 
space or inverse space. Here, the quantities - ; ir ; om are plolled, 
. aly 2 als 

respectively, along the three axes. Each node in the lattice 
(inverse lattice) thus formed represents one of the natural fre- 


quencies of vibration of the body for the numbers 7, re, 23. The 


s 


Ky 


Fig. 66 


radius vector drawn to a node of the lattice in the inverse space re- 
presents a possible vibration frequency. A sphere of radius v includes 
all points corresponding to frequencies less than v. The volume of 


‘i 4 
such a sphere is equal to suv and the volume of each cell of the 


inverse space is equal to (5) 3/v, where v isthe volume of the body. 
Therefore, the number of free vibrations of a body with frequencies 
less than v (the number of nodes in an octant of the sphere) is ex- 
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pressed by the formula 


This interesting relationship shows that the number of natural 
frequencies increases sharply as the band of frequencies being con- 
sidered increases. For high frequencies, the discrete character of 
the spectrum becomes blurred, for the frequencies are very close to 
each other. 


44. Forced Vibrations of Rods and Plates 


If the vibration of a rod, plate or other body does not take place 
in vacuum but in some medium*, namely, liquid or gas, then 
a fraction of the intensity, depending on the ratio of the wave resist- 
ance of the contiguous media, is transferred from the vibrating 
body to the medium. This idea can briefly be expressed as follows: - 
a vibrating body radiates energy. Due to radiation, the free vibra- 
lions of a rod, string, etc., are rapidly attenuated. If it is required 
that the body be a constant source of radiation, the vibration must 
be excited by an external source. Just as in the case of particle 
vibrations, the energy may be provided either by means of self- 
sustained vibrations or by producing forced vibrations. 

Depending on the method of providing the external energy and 
on its point of application, one can excile, generally speaking, one 
or more of the natural frequencies of a body capable of vibrating. 
One can, for example, produce forced vibrations in a string under 
tension in the following manner. An electromagnet fed by sinusoidal 
current from an audio generator is fixed about a steel wire. The vibra- 
tions of the wire under the action of the periodically varying exter- 
nal lateral force become perceptible only at resonance. By varying 
the tension of the wire and the external frequency, one can show 
that the wire will vibrate at the fundamental frequency as well 
as at various overtones. 

The production of forced vibrations (standing waves) in piezo- 
electric plates and ferromagnetic rods is of great practical impor- 
tance. Such vibrating bodies are generators of ultrasonic waves. 

Ferromagnetic bodies may elongate or shorten under the action 
of a magnetic field. The theory ofthis phenomenon is complex and 
will be treated only briefly in this book. For the present, it is suf- 
ficient to illustrate how the length of a ferromagnetic rod depends 
on the intensity of the field. This is done in Fig. 67, which shows 


- * One must become reconciled to the fact that vibration of a body is used 
in two senses—vibration of a body as a whole and vibration of the particles 
of a body with respect to one another. 
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that nickel and annealed cobalt shorten in fields of any intensity, 
cast cobalt shortens in weak fields but elongales in strong fields 
and, finally, iron elongates in weak fields and shortens in strong 
ones. In any case, a ferromagnetic rod can execute forced vibrations 
when placed in an alternating magnetic field. For this purpose, 
the rod is usually placed in the core of a transformer fed by alter- 
nating current. In order for the 
standing wave in the rod to be of 
sufficient intensity, it is necessary 
to operate under conditions of res- 
NSS ee onance, i.e., the frequency of the 
alternating field should coincide 
with the rod’s natural frequency 
Co (annealed) of vibration. 

Since the rod is fixed at the 
centre, the natural frequency of 
vibrations is 


Co (cast) 


Relative elongation, Aly 10° 
28 & 


' 
iS 


1000 2000 " 
intensity of magnetic field in oersteds v= wre 
Fig. 67 and the rod can vibrate only at 


; “a the frequencies of the odd harmon- 
ics. Substituting the numerical values of the physical constants, 
the fundamental frequency for nickel turns out to be equal to 


2 ae ; 
val ke/s (where 7 is in centimetres). 


Thus, a rod having a length of 40 cm will vibrate at a fundamental 
frequency of 6 kc/s. 

A piezoelectric crystal is most commonly used as a source of ultra- 
sonic vibrations. ’ 


45. Piezoelectric Vibrations 


Any crystal that does not have a centre of symmetry in a number 
of its elements of symmetry (see Sec. 262) may exhibit the piezo- 
electric effect. This phenomenon manifests itself in a change of the 
dimensions of a crystal under the action of an electric field and, 
conversely, in the creation of an electric field in a crystal under the 
action of forces applied to the crystal. When utilising the piezo- 
electric effect as a source of vibrations, we are dealing, of course, 
with the former aspect of the phenomenon, known also as electro- 
striction or the inverse piezoelectric effect. Piezoelectric materials 
include quartz crystals, Rochelle salt, barium titanate, and 
dilhydrophosphate of ammonia. Generally speaking, there are hun- 
dreds of known materials that could, in principle, be used for this 
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purpose. However, addilional requirements, e.g., durability and 
stability wilh respect to moisture as well as the natural desire to 
select crystals that will yield the strongest effect, sharply limit the 
practical choice of material. 

The change in crystal dimensions under the action of an electric 
field differs for different. directions (with respect to the crystal’s 
axes of symmelry). Therefore, different deformations will be obtained 
when, Irom a crystal, we cut rods or plates having different 
orientations with respect lo the crystal’s axes and place them between 
condenser plates. Usually, the quartz plate or other piezo- 
electric material is cul in such a manner that longitudinal displace- 
ments occur in the material when it is placed in an electric field. 
Thus, under the action of an alternating electric field, the forced 
vibrations produce standing longitudinal waves. 

If 7? is the thickness of the plate in the direction of wave motion, 
then, as usual, the natural frequency of vibration is given by the 


ne al & as e . . 
formula v =z" For a quartz crystal having this simple orientation, 


the velocity of the elastic wave is equal to 5,400 m/sec. Hence, the 
fundamental natural frequency of vibration of a quartz plate is 
determined from the formula 


2,700 
vs 


ke/s (2 is in centimetres). 


It should be noted that the measured value is somewhat different, 
namely, 2,880/1 ke/s. 

The vibration amplitudes depend on the magnitude of the applied 
field, whereby a linear dependence exists between the displacement 
magnitude and the electric field intensity. It is not uncommon to 
use large field intensities. Since quartz is an excellent insulator, 
electric fields of the order of 30,000 volls/em find application for 
thicknesses up to a centimetre. 

To attain a powerful ultrasonic signal, use is made of the reso- 
nance effect. This is essential because resonance displacements are 
thousands of times greater than displacements under the action of 
static fields, and furthermore, the vibration energy is proportional 
to the square of the displacement. 

By gradually increasing the frequency of the generator, one can 
successively excile all the overtones of the crystal. The frequency 
range of commercial ultrasonic generators extends from hundreds 
to thousands of kilocycles. 


CHAPTER VIII 
ACOUSTICS 


46. The Objective and Subjective Nature of Sound 


Man can perceive the loudness, pilch and timbre of sound by 
means of his hearing organs. The electronic oscilloscope enables us 
to investigate the objective and subjective nature of sound in detail. 

Since sound is the result of a vibratory process taking place in 
air, it may be completely described by a curve showing amplitude 
change (immaterial whether displacement, vibrational velocity or 
pressure) with respect to time. Such a curve enables us to establish 


Fig. 68 


whether the process is periodic and, if so, to determine the funda- 
mental tone of the vibration. By studying the periodicity, the over- 
tones present and their amplitudes may be determined. In other 
words, the curve showing the dependence of the vibration on time 
always enables us to find the spectrum of the vibration, i.e., lo 
establish which frequencies are present and the amplitudes of these 
frequencies in the spectrum. The curve is obtained by means of 
a microphone connected to an oscilloscope. In more claborate ar- 
rangements, the curve of the vibration is automatically converted 
into its spectrum. 

A simplified diagram of such an analyser is shown in.Fig. 68. 
The inpyt sound is converted by a microphone into electrical 
current, is amplified and applied to an apparatus consisting of 
a large number of filters, each of which passes a specific band of 
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frequencies, e.g.. = of an octave (36-48, 48-60, 60-72 cps, etc.). 


The filters resolve the signal into its spectrum; the narrower the 
frequency band of each filter the greater ils resolving power. Each 


~ 
S 


Amplitude 
S 
pas 


0 
0 500 f000 1500 2000 2500 3000 3500 
Piano Frequency 
10 
& 
u 
= 
305 
Q 
iS 
= 
0 500 jooo 1500 2000 2500 3000 3500 
Clarinet Frequency 
Fig. 69 


portion of the spectrum passing through a filler is fed via a commuta- 
tor to an amplifier and detector (rectifier). The output of the detec- 
tor is applied to the oscilloscope plates that deflect the electron beam 
in the vertical direction. If a voltage is not applied to the second 
pair of oscilloscope plates, then, upon connecting each of the filters, 
the vertical deflection of the electron beam will be proportional 
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to the amplitude of the corresponding frequency component of the 
spectrum. However, the arrangement is greatly improved by con- 
necting a voltage to the second pair of plates that provides lori- 
zontal sweep of the electron beam. This is done by properly syn- 
chronising the rotation of commutator 2, which provides the 
sweep voltage, with the automatic rotation of commutator 4. Thus, 
the amplitude of the component passed by each filter is displayed 
for a different, but unique, horizontal displacement of the electron 
beam. As a result, the complete spectrum is displayed on the oscil- 
loscope screen. 

Periodic vibrations have discrete spectra, while nonperiodic 
vibrations have continuous spectra. Musical sounds are illustrative 
of the former, while various kinds of noise illustrate the latter. 

One and the same musical tone played on different instruments 
will have the same fundamental frequency, but will have different 
spectra. The quality of the sound is determined by the distribution 
of the overtone intensities (see Fig. 69). In the musical sense, the 
more complex the spectrum the richer the quality of the sound. 
It is interesting that the phase of the overtones (see the formula 
on p. 105) does not affect the subjeclive perception of sound. The ear 
can only distinguish between the intensities of the overtones. 

Noise analysis is of great practical importance. If the noise fre- 
quencies of largest intensity are known, the cause of the noise is 
more easily determined and, consequently, more easily eliminated. 


47. Intensity and Loudness of Sound 


In Fig. 70, the heavy lines mark the limits of the region of audito- 
ry perception for the average person. Two uniquely related quanti- 
ties are plotted along the ordinate, namely, the amplitude of sound 
pressure and the intensity of sound. The sound pressure p and the 
sound intensity J are related, in the simplest case, by the formula 


9 t 
p* 
2pe 


We know that the intensity of the wave is 
f= we, 


where w is the energy density, i.e., wae . But w= f (see p. 117). 


Hence, substituting, we obtain the above formula. The intensity of 
sound may be measured in walts/cm?. 

Very intense sounds produced by a pressure of about 2,000 bars 
cause a sensation of pain. Very weak sounds can still be perceive 
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by the average person when they have a pressure of 2 x 410-7 bar 
(1 bar = 1 dyne/cm®). Since for air tes 41, we obtain for the limits 
of sound intensily the values 0.5 x 408 ergs/sec cm? (=0.5 X 10-* 
watls/cm*) and 0.5 x 10776 watis/em?. 

This large range of intensities makes it convenient to introduce 
a logarithmic scale. If the intensity of one sound is J; and the inten- 
sily of another Jz, we say that J, is K decibels louder than J; when 


K =10 log 


The quantity K is called the loudness level. Thus, if the sound 
intensities differ by a factor of a million, they differ in loudness 
by 60 decibels. 

When expressing the sound intensity in decibels, it is necessary 
to indicate the zero level. The value usually selected for this level 
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is close to the threshold of audibility (10-?° watts/cm*). A whisper, 
then, has a loudness of about 15 db and the noise of an airplane 
about 120 db. 

Returning to the diagram of auditory perception, we see that the 
region of speech is quite restricted in frequency (100 to 10,000 cps) 
as well as in intensity (40 to 80 db). Sounds of different frequency 
have different audibility. The human ear perceives frequencies of 
several thousand cycles per second best of all. Below 20 cps lies 
the infrasonic region and above 10,000-20,000 cps the ultrasonic. 

The table gives approximate values for the sound pressure p, the 
intensity J and the loudness x. 
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| Pp I WK 

(bars) (walts,’em?2) (db) 
Threshold of audibility ...... 2.9x 1074 10-16 0 
Drip of thawing snow. ...... 2.9x 10-3 10-14 20 
Low conversation at a ‘distance of 

S metres .... 2.40.0 e eee | 29K10-2 10-12 AQ 
Symphonic orchestra (fortissimo) 2.9 10-8 8U 
Airplane engine at a distance of 

3 metres ......2.-6+ 22 es 290 10-4 120 


48. Architecture and Acoustics 


In some auditoriums speech is unintelligible even though suffi- 
ciently loud, while in others the speaker must raise his voice in 
order to be heard. Let us investigate the physical parameters of an 
auditorium that determine its acoustic properties. 

Experiments show that the most important factor of this nature 
is the so-called reverberation time, the time in which a sound decreases 
to one-millionth of its original intensity. With respect to acous- 
tics, an auditorium is best when its reverberation time t is ().5- 
1.5 sec. If t is less than 3 sec, the auditorium is considered to be 
good. If the reverberation time exceeds 5 sec, the acoustics are very 
bad, being characterised by “resounding”. 

A sound uttered at some part of a large hall is reflected- from the 
walls, floor and ceiling, the furniture and drapes, and from the 
clothes of those present. If for each reflection the sound loses a large 
part of its energy, the sound will be attenuated very rapidly. The 
reverberation time in this case is very small and the sound will be 
“dull”. Resounding occurs when the sound is repeatedly reflected 
with little attenuation. The listener will perceive the direct wave, 
the wave after one reflection, two reflections, etc. If the interval 
of time between the arrival of these sound waves does not exceed 
1/15] of a second, the ear will not perceive two or three distinct 
sounds as in the case of echoes, but rather a prolonged, and hence 
unclear, sound. 

It is evident that the time attenuation of sound is determined 
by its absorption in the surrounding bodies. Since the sound is 
repeatedly reflected, after a short time of constant sounding from 
some source the auditorium will more or less uniformly fill up with 
sonic, i.e., vibratory, energy. Within a short period of time, equi- 
librium is established between the energy delivered by the source 
and the energy absorbed by the medium. It should be noted, inci- 
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dentally, that in the absence of absorption the sonic energy in a 
closed room would increase without limit for continuous sounding 
of a source. ; 

If the sound source is interrupted, then the phenomenon reduces 
to the absorption of the sonic energy by the surface of the bodies 
located in the room. Each of the materials involved in this process 
has its own characteristic coefficient of absorption a. If there is an 
open window in the room, the absorption coefficient may be as- 
sumed to be equal to 1, since the sound completely leaves the room, 
and this is equivalent to being absorbed. For a smooth, solid wall 
the coefficient a is almost zero (for concrete, it is 0.015). Now, the 
sound absorption for the entire room may be described by the ex- 
pression A = aS; + GS. + a3S3-+ ..., Where the sum takes 
into account all the surfaces in the room. Theory shows that the 
reverberation time depends on the quantity A and the volume of 


: 
‘the room V, i.e., tT = 0.16 +. In this formula, the volume is ex- 


pressed in cubic metres and the quantity A in square metres. 

By means of these formulas, it is not difficult to calculate the 
reverberalion time. The absorption coefficient for concrete is given 
above; for glass, wood and plaster, it is not much larger (up to 3 per 
cent). A sharp increase in absorption occurs when soft materials 
are brought into the room. Suffice it to note that the clothing of 
one person absorbs as much sound as 20 square metres of wall sur- 
face. For soft materials, the coefficient of absorption varies between 
0.5 and 0.9. A large role in the solution of acoustical problems in 
the construction industry is played by porous materials (e.g., spun 
glass, and porous concrete), whose coefficients of absorption approach 
the values of a for soft materials. 


49. The Atmosphere and Acoustics 


When a wave passes from one medium into another, it changes 
its direction of propagation in accordance with the law of refraction. 
The angle by which the direction of propagation changes is deter- 
mined by the index of refraction, i.e., the ratio of the velocities 
of propagation. 

It was indicated in Sec. 32 that the velocity of sound propagation 
is sensitive to changes in temperature. A temperature increase of 
4°C increases the sound velocity by about 0.5 m/sec. The temperature 
of different layers of the Earth’s atmosphere has, as a rule, different 
values. Thus, in different layers of the atmosphere, sound will have 
different velocities. How is the propagation of sound affected by 
the fact that the sound travels in a medium in which the refractive 
index is continuously changing? 
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Let us answer this question by referring to Fig. 71. Assume that 
the sound passes through a series of layers and that in each layer 
the refractive index is constant but changes abruptly from layer to 
layer. The path of the sound wave is represented by the broken line. 
If the thicknesses of the layers are small and the differences in the 
refractive indexes begin to decrease, the broken line will approximate 
a curved line. Thus, in a medium of variable index of refraction, 
sound waves propagate, generally speaking, along curved lines. 
Moreover, the path is always such 
that the wave travels from point 
to point in the shortest time. 
This proposition is known as 
Fermat’s principle. A_ straight 
line, in this case, is in a certain 
sense not the shortest. 

We shall demonstrate the va- 
lidity of this principle for the 
case of two adjacent segments 

Fig. 71 of the broken line just consid- 

ered. Let us assume, for simplic- 

ity, that the thickness d is the same for both layers and that the 
propagation velocities v, and v2 are different. The time required 
for a wave to traverse the path indicated in tee figure is equal to 


t= VEPFE LLY a Fe. 


Here, the time is expressed in terms of the independent variable cr. 
For different values of x, the refraction will differ and so will the 
time of travel from the initial point to the final point. The least 


time will be taken when the condition a = 0 is satisfied, i.e., when 


vj ms x b a—rzr 
x a—-Zr 
But Vx2+a2 V@—22 +a 
sine of the refraction angle. This proves that the refraction of the 
wave occurs in such a manner that its time of travel is a minimum. 
It should be emphasised that this result is valid not only for elastic 
waves but for all undulatory processes. 

Thus, a wave travelling in a nonhomogeneous medium changes 
its direction in such a manner that its path is lengthened in a medi- 
um in which the propagation velocity is larger and shortened in 
a medium in which the propagation velocity is smaller. In other 


is the sine of the incident angle and is the 
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words, a wave in a layer where the propagation velocity is larg« 
will tend to travel parallel to the layer, while in a layer where the 
propagation velocity is small it will tend to travel perpendicula: 
to the layer. 

This is clearly illustrated in Fig. 72. Here, the path of a sound 
wave is schematically shown for the case when the temperature 
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of the air decreases with height (usual day-time condition) and for 
the case when the temperature increases with height (night-time 
condition). 

In the former case, the velocity of sound propagation is large in 
the layers close to the Earth. If we trace the propagation of the 
sound wave emanating from a point above the Earth’s surface at 
a small angle with the vertical, the following situation is seen tc 
prevail. Each of the successive layers deflects the wave further and 


further away from the vertical. When the angle of incidence 


sini a 
= 1, refraction ceases 


becomes equal to the angle ig, for which 


and total reflection occurs. Formally, the reason for total reflec- 
tion is clear, namely, sin i cannot become larger than unity. 
The physical basis of this interesting phenomenon will be consi- 
dered below (Sec. 128) in connection with electromagnetic waves. 
In any case, instead of being propagated along the I[arth’s 
surface, the wave is turned in an upward direction. The diagram 
makes clear how “zones of silence” are formed. At night, the path 
of a sound wave is turned convex upwards. As a result, audibility 
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at night is much better than during the day. When sound propagates 
over a reflecting surface (a still body of water), it can be heard for 


Fig. 73 


several kilometres even when its intensity is relatively low. The 
path of such a wave is represented by a series of consecutive convex 


arcs (Fig. 73). 
50. Ultrasonics 


The vibratory energy in a unit volume of a sonic field is proportion- 
al to the square of the frequency. Thus, the density of the vibrato- 


ry energy is wah , but the amplitude of the velocity is uw = Ao, 


so that w is proportional to w*. A powerful ultrasonic source is capa- 
ble of producing vibrations with a pressure amplitude of dozens 
of atmospheres. This means that in small volumes of matter, we 
go through the following cycle several thousand times per second: 
up to dozens of atmospheres of compression, down to zero, then up 
to dozens of atmospheres of expansion, etc. 

It is evident that a powerful mechanical action of this nature 
may have a number of specific effects. One such effect is cavitation. 
At the instants of vibration corresponding to maximum expansion 
in a liquid located in an ultrasonic field, microscopic explosions 
occur and dissociated gases and steam rush into this region. At the 
instants of vibration corresponding to compression, tremendous pres- 
sures of the order of thousands of atmospheres are produced in the 
regions of these explosions. 

This powerful force may be used to overcome the forces acting 
between molecules. Emulsions such as fat in water and benzene in 
water become dispersed under ultrasonic action. Sooner or later 
cavitational explosions occur. in the suspended particles. This disin- 
tegrating action has found wide application in industry. 

However, ultrasonic action may be of considerable importance 
even when cavitation does not occur. Thus, if an ultrasonic wave is 
passed through aeorsol (a suspension of solid particles in a gas, e.g., 
smoke), the particles are precipitated out. The vibrations cause the 
solid particles to gather at the sound pressure nodes, where the 
particles merge and become sufficiently heavy to fall to the ground. 

Finding blowholes, internal cracks and other defects in metals by 
means of ultrasonic irradiation is another important field of appli- 
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cation. The method is based on the reflection of such a wave from 
the boundary between the medium and air, i.e., between the metal 
and the inclusion. Only if the dimensions of the defect are greater 
than a wavelength will the method work. In order to detect a defect 
having a dimension of 4 mm, the wavelength should be less than 
0.4 mm, i.e., a frequency of the order of 10° cps. The frequencies 
used are usually much lower (i0’ cps), the method being employed 
to detect large flaws. : 

As is well known, ultrasonics also finds application in echo sound- 
ing and underwater location. 


CHAPTER IX 
TEMPERATURE AND HEAT 


51. Heat Equilibrium 


When all the properties of a body remain unchanged, we say 
that the state of the body has not changed. On the other hand, when 
some property of the body changes, its state changes. The state of 
a body may be changed by doing work on it. However, the same 
results may be achieved without using mechanical means. Water 
is heated by intensely stirring it or by placing it on a gas burner. 
Heat exchange is said to take place when the external medium or 
surrounding bodies act on a body or system of bodies under consid- 
eration so as to change the state of this body or system of bodies 
by nonmechanical means. 

If there is no heat exchange between the bodies, the bodies are 
in thermal equilibrium and have the same temperature. The presence 
of thermal equilibrium can be directly verified by bringing the 
bodies into contact with each other: whereupon the states of the 
bodies after contact should not differ from the states before contact. 
However, heat exchange is also possible when bodies are far apart. 
Thermal equilibrium may be detected, in this case, by means of 
a third body acting as a thermometer. If the thermometer is in 
equilibrium with both bodies, the temperature of these bodies is 
the same. This means that they would also be in a state of thermal 
equilibrium when in direct contact. By means of a “third body”, 
a thermometer, it can always be ascertained whether bodies have 
equal or different temperatures. 

By means of a thermometer, we can establish not only whether 
thermal equilibrium prevails or not, but also the extent of a partic- 
ular deviation from equilibrium. To obtain a suitable thermometer, 
it is first necessary to agree on the type of thermometer (mercury, 
alcohol, water or gas) and the property (indication) by which we 
shall judge whether thermal equilibrium has been achieved between 
object and thermometer. As always in physics, it is important to 
agree on what instruments, in this case thermometers, will be con- 
sidered primary. A thermometer can, then, always be calibrated 
by means of a standard. Gaseous hydrogen is the material used in 
a slLandard thermometer and the gas pressure p is the property by 
which the temperature is determined. The temperature of a body 
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is taken as proportional to the hydrogen pressure in a gas ther- 
mometer at the constant.volume occupied by the hydrogen. 

The temperature scale is selected as follows. We call the temper- 
ature of the melting point of ice 0° and that of the boiling point 
of water 100° (at a pressure of 760 mm of mercury). Measuring the 
hydrogen pressures Po and Pyoo at these two points, and drawing 
a straight line through the plotted points, we obtain the Celsius, 
or Centigrade, scale. The equa- 
tion of this line, shown in 
Fig. 74, has the form 

p- PPO ye 100. 
Pi00 — Po 

The straight line intersects 

the t-axis at a temperature 


of —273.4°C. This is abso- ~273/% 0% t 
lute zero. By definition, lower ee 
temperatures are not pos- Fig. 74 


sible. In physics, we often 

use a temperature calculated on the basis of absolute zero, namely, 
T =t-+ 273.1°. This is called the absolute temperature or the tem- 
perature in degrees Kelvin (°K). 

When we calibrate working thermometers with respect to a hydro- 
gen standard, only a limited interval of temperatures may be used. 
At high temperatures, diffusion of the hydrogen through the walls 
of the vessel may begin to occur. At low temperatures, the hydrogen 
may liquefy. Nevertheless, the adopted method of determining 
temperature has complete general validity as will be shown below 
(p. 160). 


52. Internal Energy 

The basic characteristics of bodies in the presence of mechanical 
and heat interaction is very well depicted by the so-called kinetic 
molecular model. A body consisting of molecules is considered as a 
system of moving and interacting particles subject to the laws of 
mechanics. Such a system of molecules has an energy consisting of 
the potential energy of the interacting particles and their kinetic 
energy of motion. This energy is called the internal energy of the 
body. 

A specific internal energy corresponds to a specific state of the 
body. Changes in the mutual disposition or character of particle 
motion are related to changes in internal energy. Irrespective of the 
means employed to increase the internal energy of a body, the sur- 
rounding bodies must transfer energy to the molecules of the body 
under consideration. If the body is subjected to mechanical action, 
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the energy transfer occurs in a regular manner. In the case of heat 
exchange, the energy is transferred through chance impulses trans- 
mitted now to one, now to another molecule. 

The quantity of energy transferred to a body by mechanical means 
is measured by the amount of work done on the body. The quantity 
of energy transferred through heat exchange is measured by the 
quantity of heat. 

Since the exact calculation of the internal energy of a body is very 
difficult and in most cases impossible, and since the very conception 
of internal energy as a purely mechanical quantity is only a rough 
one, a Clear determination of this quantity is necessary. This can 
be done by considering processes occurring without heat exchange 
with the surroundings, i.e., so-called adiabatic processes. Adiabatic 
conditions can be provided by using a thermally insulated container 
for the experiment and taking the measurements during short inter- 
vals of time (so that the heat does not have time to “escape” from 
the volume under study). Numerous experiments leading to the 
establishment of the law of conservation of energy show that, irre- 
spective of the means employed to change the state of a body in such 
a process, the amount of work required is exactly the same in each 
case. The magnitude of this work, A, is equal by definition to U, 
the increment of the internal onergy of the body: 


yes 6 Peed 6 fe 


Naturally, the absolute value of the internal energy cannot be deter- 
mined from the experiment. 

If the mechanical model of a body were a completely faithful 
representation, the above expression would be a simple consequence 
of the law of conservation of mechanical energy. However, the kinet- 
ic molecular model is only a model and, therefore, the fact that 
there is a specific energy corresponding to each state of a body, so 
that the difference in energy between two states is equal to the 
adiabatic work of transition, represents an extremely important 
law of nature leading to the law of conservation of energy. 

Heat exchange and mechanical action can lead in a number of 
cases to the same change of state, i.e., to the same change in the 
internal energy of the body. This enables us to equate heat and work 
by measuring the quantity of heat in the same units as work and 
energy. 


To obtain an idea of the magnitudes of various internal energies, let us 
cite some figures 

When the: temperature of 1 gm of water is raised by 1°, the energy in- 
creases by 


1 calorie ==0.427 kg-m=4.18 X 107 ergs = 2.61 x 1019 ev. 
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In this, case, the average increase in energy of a molecule is 
3 x 10723 calorie = 1.28 x 10-23 kg-m = 1.25 x 10715 erg = 7.83 x 1074 ev. 
The internal energy given up by matter in the combustion of 1 gm of coal 


amounts to 
7,000 calories =: 2,990 kg-m = 2.93 x 1011 ergs = 18.3 x 1022 ev. 


Calculating this on the basis of one atom of carbon, this figure reduces to 
1.4 x 10-19 calorie = 5.98 x 10-20 kg-m = 5.86 « 10-12 erg = 3.66 ev. 
The energy released in the nuclear fission of 1 gm of uranium-235 is 
2.03 x 1010 calories = 8.65 x 109 kg-m = 8.49 x 1017 ergs = 5.29 x 1029 ev. 


Calculating this on the basis of one atomic nucleus, the internal energy given 
up amounts to 


7.9 x 10-!2 caloric = 3.38 x 10-12 kg-m=3.3 x 10-4 erg = 
= 206 x 108 ev = 200 Mev, 


which is more than 50 million times greater than the energy of chemical re- 
actions. 


53. The First Law of Thermodynamics 


In the most general case, when energy is exchanged with the 
medium or with surrounding bodies, the system under consideration 
may receive or give up a quantity of heat Q and perform or have 
performed on it a given quantity of work A. Heat and work are the 
two forms in which the energy of a body may be transmitted to the 
medium or, conversely, the energy of the medium may be transmitted 
to the body. The law of conservation of energy excludes the possi- 
bility of any loss in the energy exchange. The difference in the ener- 
gies of the system for the two states must equal the sum of the heat 
and work obtained by the system from the surrounding bodies. 

This proposition could not be subjected to experimental verifica- 
tion if we did not add that the incremental energy due to the tran- 
sition from one state to another is always the same irrespective of 
the character or method of transition from the initial to the final 
state. It is precisely this provision that embodies the law of conser- 
vation of energy. Now, it can clearly be subjected to all-sided exper- 
imental verification: by measuring the heat and work imparted 
to the system for various transitions from a particular initial state 
to a particular final state. The incremental energy in all cases will 
be identical. 

The law of conservation of energy expressed in the above concrete 
form is called the first law of thermodynamics. This very important 
jaw of nature was established as the result of the work of a number 
of scientists in the middle of the last century. The roles played by 
Robert Mayer, Joule and particularly Helmholtz rank especially 
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high on the list. As physics developed, the relationship of the first 
law of thermodynamics to the more restricted law of conservation 
of mechanical energy and the more general law of conservation and 
transformation of energy became ever clearer. During the early 
development of physics, there was a tendency to equate the law of 
conservation of mechanical energy with the first law of thermodynam- 
ics. The growth of science has shown that such a simplified view 
is erroneous, that many forms of energy do not reduce to mechanical 
energy. It was noted by Engels that the law of conservation and 
transformation of energy, which indicates that any form of motion 
may be transformed into any other, subject to the preservation of 
certain quantitative relationships, is the most general law of exist- 
ence of matter. The first law of thermodynamics is a concrete expres- 
sion of this law. 

In order to write the first law of thermodynamics in the form of 
a formula, we must first agree on the choice of sign for heat and 
work. Let us assume that heat is positive when it is imparted to the 
system and work is positive when a body performs it against the 
action of external forces. The first law of thermodynamics may then 
be written in the form 


AQ = dU + AA, 


i.e., the heat applied to a body goes to change the internal energy 
and perform the work of the body. Naturally, each of the quantities 
entering into the equation may be positive or negative depending 
on the particular transformation being considered. 

It is not accidental that in writing the above equation the differ- 
ential sign was used only for the energy. Work and heat are not 
total differentials. When a body goes from one state to another, 
the work and heat received or given up by the body depend on the 
“path” traversed, and only the energy increment, as in the case of the 
total differential of a function, does not depend on the manner of 
transition: 


2 
ete 


The law of conservation of energy and, in particular, the first 
law of thermodynamics apply in all branches of physics. Science 
is enabled to make particularly valuable predictions on the basis 
of this law. Without knowing anything about the nature of a par- 
ticular process except the initial and final states of the system under 
consideration, a number of important conclusions may be drawn. 
For example, assume that molecules A and B are united by a chem- 
ical reaction to form the molecule AB. Assume, further, that U4, 
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Uy, and Uap, the internal energies of the molecules, are known. 
If Uay is greater than U, + Uz, we are able to predict that the 
reacLion will proceed with the absorption of heat and the quantity 
of heat will be equal to Q = Uyg — (Ua + Uy). Or, if we know 
U, and U;, and measure the heat of the reaction by means of 
a calorimeter, we can determine U,,. These data can then be used 
to predict the course of some other reaction involving the com- 
pound AB. 


54. The Internal Energy of Microscopic Systems 


Naturally, the law of conservation of energy and the principles 
of energy exchange are valid for large bodies as well as for the indi- 
vidual particles of a body. However, when considering particles 
(nuclei, atoms and molecules), or systems consisting of a small 
number of particles, one other important law of nature must be 
taken into consideration, namely, the energy of a microscopic system 
Cannot assume any arbitrary value. Each system has a sequence 
of possible values of internal energy, U;, U2, ..., that is character- 
istic of it alone. Figure 214 (p. 487) shows the possible energy levels 
for a hydrogen atom. Similar diagrams may be drawn for the energy 
levels of other atomic systems. When imparting heat or work to 
a system, the energy of the atoms, molecules or other microscopic 
systems may increase only by specific, discrete amounts (quanta). 
In exactly the same manner, energy is given up to surrounding 
bodies in quanta. 

Strictly speaking, the law of the quantum character of energy 
and the existence of a “scale” of possible energy levels for each mi- 
Croscopic system is a perfectly general law of nature that is valid 
for large bodies as well. However, as is shown in theoretical physics, 
the number of energy levels in a large body consisting of z atoms 
is, roughly speaking, 2 times the number of energy levels in a single 
atom. 

As the energy increases, the intervals between levels become small- 
er and smaller (see the diagram for hydrogen). The reduction in the 
interval between energy levels is incomparably more rapid for 
a large body than for an individual atom and only the very lowest 
levels appear discrete. The higher levels merge and it appears as 
though a large body can change its energy continuously. If energy 
is taken away from a body, it “descends” to a lower level. Hence, 
the lower the temperature of a body, i.e., the closer it is to absolute 
zero, the sharper the quantum character of the energy changes. 

Mechanical action serves to shift the energy levels of a body or 
system, but in the overwhelming majority of cases this displacement 
cannot be observed. For microscopic systems—atoms and mole- 
cules—the effect of pressure is very small. 
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Thermal interaction consists in the transitions of a system from 
one energy level to another. 

Thermal equilibrium is mobile equilibrium. A body does not have 
a single energy all the time, but is rather continuously exchanging 
energy with its surroundings, so that on the average the energy 
remains unchanged. The exchange of energy occurs in discrete amoun|s 
or quanta. If at one instant the energy is equal to U;, the next 
instant it has changed abruptly to Uy. . 

Energy is given up in the form of radiation. If U; > Uz, then 
O, — Uz, = bv, where v is the radiation frequency and h is Planck’s 
constant. This constant is equal to 6.62 x 10-*’ erg sec. Energy may 
be gained by absorbing radiation or as the result of a mechanical 
impulse from some particle. 

If the temperature of a body drops instead of remaining constant, 
this signifies that the number of transitions from higher to lower 
levels exceeds the number of transitions in the reverse direction. 
The energy decreases in jumps and the body emits one quantum of 
radiation after another. 

The diagrammatic representation of energy exchange was origi- 
nally carried out only for atoms. Somewhat later, it became evident 
that this representation has general validity. We refer the reader 
to Part III of this book for further details on this subject. 


55. The Equation of State 


Three basic properties or parameters of state may he selected from 
the various properties of a body. These are the pressure p, the vol- 
ume v and the temperature 7. Knowing these parameters is not always 
sufficient to exhaustively describe a body. If a system consists of 
various substances, we must also know their concentrations. If 
a body is located in an electric or magnetic field, we must know 
the intensity of the field. However, it is always possible to select 
a group of parameters that will uniquely determine the state of 
a body. The other characteristics may then be calculated from the 
basic parameters. 

Leaving electromagnetic fields out of consideration and restricting 
ourselves to simple systems—gases, liquids and isotropic solid 
bodigs—it turns out that only two parameters determine the state of 
a bag It is immaterial which pair of parameters are selected from 
p, Vand 7. Usually, v and T are selected. The pressure p is then 

-- a fiuction of v and 7. We call the equation 


p=fv, L) 


"the equation of state. It is of very great importance in physics to be 
.. alfle to determine this equation for a body and, in particular, for 
a 
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a class of bodies. Equations of state may be established only exper- 
imentally. The nature of the dependence of pressure on volume 
and temperature for liquids and solid bodies varies tremendously 
from case to case. By establishing the equation of state for a given 
body, we are able to determine its behaviour under a variety of 
conditions, but this does not in any way enable us to determine 
the behaviour of other bodies. 

Quite often, the behaviour of a substance is not described by an 
equation of stale, but rather by the derivatives of certain of the 
parameters with respect to the others. 

To establish how a body expands with increasing temperature, 


2 d i as 
at constant pressure, we must determine (+), (the derivative of 
v with respect to 7 at constant pressure). The quantity 
=— ( Ov ) 
eo CF) p 
is called the thermometric coefficient of dilation. As can be seen 
from the formula, « gives the relative change in the volume of 


a body for a 1° change in temperature. 
The thermometric coefficient of change of pressure 


| (9p 
B = P ( or i, 7 
gives the relative change in the pressure for a 1° change in tempera- 
ture (at constant volume). The dimensionality of the coefficients 


and f is given in reciprocal degrees. 
The third useful quantity is the compressibility 


which gives the relative decrease in volume for a unit increase in 
pressure (at constant temperature). 

These three coefficients are connected by a very interesting rela- 
tionship, which is easily derived as follows: Since 


p=fv, T), 
then 
dp = (35), aT - + (3), dv. 


If the pressure is constant, then dp = 0 and 


(sr), (47), (Se),=— 


Bx 
a 


Hence 


1 
" . 
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This result shows that if we know, for example, the compressibility 
and the thermometric coefficient of change of pressure, we can cal- 
culate the value of the thermometric coefficient of dilation. The 
derived relationship is valid for all bodies. 

Generally speaking, the coefficients «, B and * are not constant 
quantities for a given substance. For various pressures and tempera- 
tures, these coefficients may assume various values. Hence, when 
the value of a coefficient is given, it is necessary to indicate the 
corresponding values of pressure and temperature. In some cases, 
the average value of the coefficients is given over a particular inter- 
val of temperatures or pressures. 


Here are some examples: 


a) The table gives the thermometric cvefficient of dilation @ and the com- 
pressibility » for some liquids. 


a (deg-1) % (atm!) 
Water, 10°-30°C, normal pressure . 2.07% 1074 47.5 10-6 
Mercury, 10°-30°C .. 2... 2... 1.81% 10-4 2.95x 10-6 
Ether, O°C ...........4. 16.56 10-4 146 10-6 


For solid bodies, the thermometric coefficient of dilation and the compres- 
sibility may vary considerably. Thus, at normal temperature and_ pressure, 
for fused quartz, «@ = 1.29 xX 10-® deg™? and x= 2.7 * 107%, while for 
ebonite, @ = 77 X 107% deg"! and x = 18 K 1078. 

b) The values given in the table for B, the thermometric coefficient of change 
of pressure, are calculated for water, mercury and ether at atmospheric 


px ) 
pressure ( z 


| Water Mercury | Ether 


Pile gh 42-8 ee aos cho aele a e S | AA | 61.4 11.3 


This means that when the temperature of a constant volume of mercury is 
increased by 107% degree, its pressure increases by 6 per cent (!). 


56. The Equation of the Gas State 


The simplest equation of state is that of a rarefied gas. It was 
obtained by Mendeleyev in the form of a single formula that com- 
bines Clapeyron’s equation and Avogadro’s law. Clapeyron’s equa- 
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pe 


tion states that + is a constant for a given mass of gas: 


i = const. 

Avogadro's law states that gram molecules of different gases at 
constant temperature and pressure occupy equal volumes (22.41 
litres at a temperature of O°C and a pressure of 1 atmosphere*). 
Hence, for one gram mole, the constant in Clapeyron’s equation 
is a universal constant. It is designated by the letter R and is called 
the universal gas constant. For a mole of any gas, the equation 
assumes the form 


pv=RT. 


Here v is the volume of a mole of gas. The constant A has the dimen- 
sions of work per mole degree. Expressed in various units, its value is 
= 4. > ergs _ 4 joules = 
Rage SiR? mole in mole deg 
= 94 atm litres _9 calories 
0.0824 mole deg ~~ mole deg * 


Since the volume of an arbitrary mass of gas is V = pv, where 
yu is the number of moles, the equation of state for a rarefied gas 
assumes the following form in the most general case: 


pV =wRT or pV =—— RT. 


Here mm is the mass and AZ is the molecular weight. 
This equation yields the following convenient formula for the 
gas density p: 


Gases obeying the equation of the gas state are called ideal gases. 
The simplicity of the equation would in itself be sufficient grounds 
for using this term. However, as we shall see below (p. 193), this 
equation may be derived by assuming the gas to be represented 
in an ideal system. An ideal gas is then a system of molecules whose 
dimensions and forces of attraction may be neglected. 

For ideal gases, the coefficients of dilation, change of pressure 
and compressibility are given by the following simple formulas: 

OB =+ ace os ‘ 

* A physical atmosphere is meant here (1 physical atmosphere = 1.033 

engineering atmospheres). 
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At a temperature of 0°C (7 = 273.13°) 
=P =sa75 deg? = 0.00366 deg”. 


The following data show to what extent certain actual substances 
approach the ideal condition: 


a@at V=const 


Hydrogen .......... 3,660 x 10-6 
Helium ........... 3,660 x 10-6 
Nitrogen .......... 3,674 10-6 
Carbon dioxide ....... 3,726 x 10-6 
Ao. ce ts, chet, oh ce igen 28 Srey 3,674 % 10-8 


Gaseous substances under a pressure that considerably exceeds 
atmospheric cease to obey the formulas of an ideal gas. Calculations 
may lead to errors of several per cent at pressures of only tens of 
atmospheres. 

An important conclusion to be drawn from the study of rarefied 
gases is that, generally speaking, any of these gases—and not only 
hydrogen—may be used as a basis for determining temperature. 
Hydrogen provides no particular advantage over other rarefied 
gases. It can, therefore, be said that the temperature scale adopted 
in physics is not a hydrogen scale, but rather a scale of pressures 
for an ideal gas. Herein lies the advantage of the method adopted 
for determining temperature, namely, the existence of a large class 
of substances leading to temperature scales that are identical. The 
kinetic molecular basis for the selection of the temperature scale 
will be given below (p. 193). 


57. The Equations of State of Actual Gases 


The equation of the gas state begins to yield very rough results 
for gases at high pressures, steam. close to saturation and a number 
of other cases. Other equations of state are, therefore, required for 
such cases. Some are determined experimentally, while others (the 
most well known being Van der Waals’ equation) have, qualita- 
tively, a theoretical basis. In any case, the validity of one or another 
equation can be established only through comparison of the results 
calculated by means of the equation with the results obtained exper- 
imentally. We shall now give some examples of equations of state. 


Naturally, the simplest correction that can be introduced into 
the equation for ideal gases takes account of the volume of the gas 
molecules. It is evident that a gas cannot be compressed to zero 
volume even if the pressure is infinitely large. Hence, the equation 
of stale may be written in the form 


p(v—b=R7, 


where b is a constant that takes account of the finite volume of the 
molecules. 

The greater the number of constants introduced into the equation 
of state the easicr it is to achieve close agreement between experi- 
mental and calculated values, but the more difficult it is to predict 
changes by means of the formula. Excellent agreement with experi- 
ments over a broad interval of values for the parameters of state 
is obtained by means of the formula proposed by Beatie and Bridge- 
man. It contains five constants—A, B, a, b and c—descriptive of 
the substance: 


p= P98 v4 BY) F 


where 
Cc 


, 3 a ara / b pans 
A’=A(i1—<), B =B(1-2), e=>R- 
Dieterichi’s formula contains three constants—a, D and s: 


peaD RF. CO. 


Van der Waals’ equation contains two constants—a and 6: 
(p+) (v—b) = RT. 


The merit of the last equation is that it correctly reflects the general 
character of the dependence between the parameters for all gaseous 
substances. However, for a given substance, it is not possible to 
select constant values for a and b in such a manner that the calcula- 
tions agree closely with measurements over a broad interval. 
Van der Waals’ equation is based on the following: The pressure 


: : 7 T 
satisfies the equation of the gas state, i.e., p =, when the forces 


of attraction between the molecules are neglected. But due to the 
mutual altraction of the molecules, the pressure on the walls of 


a vessel should decrease by some value p’. Thus, p = x — p’. 
Now, taking the finite volume of the molecules into consideration, 
RT , , : 
P=5-,—P’ or (p+ p’)(v—b) = RT. 


41-1409 
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Why does p’ = =? Here, we reason as follows: Let us consider the 


gas volume divided into two parts. One part, then, attracts the 
other. The forces of attraction are proportional to the number of 
molecules in the left-hand part and to the number of molecules in 
the right-hand part. In other words, the forces of attraction are 
proportional to the square of the density, i.e., inversely proportional 
to the square of the volume. 

The forces of attraction between molecules will be discussed in 
more detail in Part III. 


CHAPTER X 


THERMODYNAMIC PROCESSES 


58. Graphical Representation 


If two parameters of state are given for a body, the third may 
be calculated by means of the equation of state. Thus, when one 
parameter (e.g., pressure) is plotted along one axis and a second 
parameter (e.g., volume) is plotted along the other'axis, the state 
of the body is uniquely described by a plotted point. 

To be sure, it has been assumed in the graphical representation 
that the state of the body’is in equilibrium. Only then will the 
values of the parameters of state be the same throughout the volume 
of the system and are we justified in speaking of the temperature, 
pressure, density, etc., of the body (or system) as a whole. 

The question arises: What kind of process can be involved if 
equilibrium states are being considered? The answer consists in the 
following: In a process that proceeds sufficiently slowly, the values 
of the parameters of state throughout the volume are equal. Such 
‘ process may be considered to be a continuous succession of equi- 
librium states. It is reversible and may occur in either direction. 
A. process consisting of a succession of equilibriums may proceed 
from state Z to state 2 and then from state 2 back through all the 
intermediate states to state 7, without producing any changes in 
the surrounding medium. 

_ A reversible process is an idealised process. Every actual process 
is in one way or another irreversible, depending on how far away 
the intermediate states of the process are from equilibrium. 

This becomes clear from the following reasoning. Every establish- 
ment of equilibrium is irreversible. Many simple and familiar exam- 
ples can be cited: the cooling of a body by placing it in a cooler 
surrounding, the “dissipation” of a mechanical deformation, e.g., 
the restoration of a compressed spring to its undeformed position 
upon being released, the spontaneous intermixing of two gases, etc. 
Reversible processes cannot proceed of themselves. They cannot 
be single processes occurring in a closed system. oe, 

An actual process does not consist of a succession of equilibriums. 
Inevitably, such phenomena occur as those enumerated above. Hence, 
when the process is made to proceed in the reverse direction, it will 
never pass through exactly the same states as during the forward 
direction. When a gas is rapidly compressed, the pressure of the gas 
in layers close to the piston will be higher than elsewhere. On the 


41* 
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other hand, during the reverse process—expansion of the gas—thie 
pressure near the piston will be lower than elsewhere. 

Nevertheless, in spite of the fact that reversible processes are an 
idealisation, they are of great interest since in any cases the differ- 
ence between actual and reversible processes is insignificant. It all 
depends on the relaxation time, i.e., the time it takes for equilibrium 
to be established. This time varies within very broad limits—from 
~10-15 sec, the time it takes for the 
pressure to equalise in a homogencous 
gas, to minutes, hours, or even weeks, 
for processes involving heterogeneous 
substances. 

Let us assume that a gas is com- 
pressed and the entire process takes 
one second. The relaxation time is 
an insignificant fraction of a second. 
We may therefore consider the actual 
process as a succession of equilibrium 
states and draw the curve on a graph 

Fig. 75 showing p and v, or on some similar 

diagram. The same holds true for all 

other processes in which the relaxation time is small with respect 
to the duration of the process. 

Fig. 75 shows several curves representing simple processes. The 
coordinates of the graph give the pressure and the volume. In engi- 
neering thermodynamics, other coordinates are used in addition 
to these, but we need not discuss them. The vertical line J in the 
figure represents a process at constant volume. If the point generating 
the curve is moving upwards, the pressure is increasing; if the reverse 
is true the pressure is falling. It is clear that a change in tempera- 
ture occurs during this process that is not “seen” on the diagram. 
The horizontal line 2 represents a process at constant pressure (iso- 
baric process). Moving from left to right signifies expansion, while 
the reverse corresponds to compression. The curve designated by the 
figure 3 corresponds to an expansion accompanied by a drop in pres- 
sure, while curve 4 represents an expansion in spite of the increasing 
pressure. The change of temperature in any process may be calcu- 
lated by means of the equation of state.. 

In most thermodynamic processes, all the parameters of state are 

_ changing simultaneously. Nevertheless, a number of simple but at 
the same time practically important exceptions may be singled out. 

These include the processes mentioned above, namely, at constant 

volume (isochoric) and at constant pressure (isobaric), as well as 

the processes occurring without heat exchange (adiabatic) and at 
constant temperature (isothermal). 
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59. Work and Cycles 


In mechanics, work is usually represented as the product of 
a force and a distance. In thermodynamics, we are usually interested 
in the work of changing the volume of a body. Fig. 76 shows the 
shape of a body in two states. The volume is shown to have changed 
from vy to ve. The total work of changing the volume may be consid- 
ered as the summation of the work expended in displacing the ele- 
ments of area dS by the distance dl. If the applied forces are perpen- 
dicular to the surface, the work of displacing a surface element is 
equal to fdl or, in terms of pres- 
sure, pdS dl. Thus, 


dA=pdv, 
where dv is an element of volume 
change. It is evident that the total 
work is given by the definite inte- 
gral: 
Vo 
A= pdv. 


v. 


4 

In a pressure-volume diagram, the 
work of compression or expansion Fig. 76 
can be represented geometrically. ; 

It is simply the area under the curve (bounded on the left and the 
right by vertical lines through the points representing the initial 
and final values of the volume). 

If the pressure during the process of compression or expansion 
remains unchanged and if, moreover, it is the same at all points 
on the surface, then p may be brought out from under the integral 
Sign and the formula for work becomes 


A= p(v2—V). 


As we have already stated, work may be considered positive or 
negative depending on the convention adopted. We have assumed 
that work is positive when a body does work on the surrounding 
medium, i.ec., work of expansion. Accordingly, work of compression 
is negative. 

If a body is transferred from state 7 to state 2 as a result of some 
process, and then transferred back to its original state via the same 
path, the total work of such a process is naturally equal to zero. 
Here, the work of expansion, performed on external bodies, is equal 
to the work of compression, performed by external bodies on the 
system under consideration. However, the situation is completely 
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different when the “forward” and “return” paths differ. Processes in 
which a body is returned to its original state via a different path 
are called cyclical processes. Fig. 77 shows two such cycles and the 
arrows indicate the direction of the processes. One process proceeds 
in a clockwise direction and the other in a counterclockwise direc- 
tion. A process going from left to right signifies expansion. Thus, 
in the clockwise cycle the work of expansion is greater than the 


“ 


Fig. 77 


work of compression. In this case, work is performed on the sur- 
rounding medium. It is evident that in the counterclockwise cycle 
a certain amount of work is performed on the system under consid- 
eration. In either case, the work performed during a cycle is rep- 
resented by the enclosed area (hatched in the figure). 


60. Processes Involving a Change of Gas State 


Let us consider the relations for the four simplest processes involv- 
ing a change of gas state, whereby, in the main, we shall restrict 
ourselves to gases obeying the equation of the gas state. It will be 
presently seen that knowing the equation of state for a substance 
and applying the first law of thermodynamics, a number of valuable 
conclusions may be drawn regarding the behaviour of the body under 
-various conditions. The first law of thermodynamics for gases will 
be used in the form 


AQ=dU + pdv. 


The Isochoric Process. At constant volume, the fist law of thermo- 
dynamics assumes the form 


AQ = dU. 
Heat exchange occurs between the system under consideration and 


the external medium, but no work is performed on the external 
medium or the system under consideration. Two possibilities exist: 
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either 1) the body absorbs heat from the surrounding medium and 
its internal energy increases or 2) the body gives up heat to the 
medium and its internal energy decreases. 

The quantity of heat required to increase the temperature of a 
body by one degree at constant volume is called the thermal capacity 
at constant volume and is designated by the letter c with subscript v: 

au 
ee ites : 

If the dependence of the internal energy of the gas on the temper- 
ature is known, the thermal capacity cy may be calculated. 

At high temperatures, there is a linear dependence between the 
internal energy of a gas and the temperature, since the thermal 
capacity in this case does not depend on the temperature. 

We are unable here to prove an important theorem. If follows, 
however, from the general laws of thermodynamics that if the 
dependence between p and 7 is linear, then c, cannot depend on 
the volume. Since such a linear dependence exists for gases obeying 
the equation of the gas state and Van der Waals’ equation, then c, 
does not depend on v for gases and the phrase “at v = const” may 
be omitted in the above formula. Thus, 


dU 
Cy =p (for gases). 
If the dependence of c, on temperature is only slight, the internal 
energy of a gas may be represented by the formula 
U=c,T + const. 


For ideal gases, the constant does not depend on the volume and 
may be dropped. For a gas obeying Van der Waals’ equation, the 


constant equals —< . Thus, 


U=c,T for an ideal gas 
and 


U=c,T — ~ for a gas obeying Van der Waals’ equation. 


We see that, in the case of an ideal gas, a change in the volume of 
the gas when the temperature is maintained constant does not involve 
a change in energy (see p. 175). Jf the molecules are drawn together 


. . a . 
with a force per unit area of p’ = = then upon expansion of the 
gas the energy increases by the amount of work done against this 
force, i.e., by 


Sy p’ dv= — <= + const. 
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The Isobaric Process. In this process, all three terms in the equa- 
tion for the first law of thermodynamics are different from zero. The 
system exchanges heat and work with its surroundings without 
a change of pressure occurring in the system. This process usually 
involves absorption of heat by a body from its surroundings; however 
not all the heat serves to raise the internal energy of the body and, 
in part, it is returned to the surroundings in the form of mechanical 
work. We shall not consider other cases. 

It is perfectly evident that the thermal capacity in this process 
will differ from that in the isochoric process considered above. In an 
isobaric process, the heat is used not only for raising the temperature. 
Hence, c, (thermal capacity at constant pressure) must be greater 
than c,. The difference may, in some cases, be calculated. 

Let us divide both sides of the equation for the first law of ther- 
modynamics by an incremental temperature: 


_AQ_ ad, dv 
Ee ap Te ae 
This expression for the thermal capacity is valid for any process, 
including the isobaric process under consideration. For gases, this 
formula may be rewritten as follows: 


dv 
Cp= Cot Pap : 


For an ideal gas, the result obtained is very simple. Since pv =pRT, 
then oa and c, = c, + pR. Thus, the difference between the 
thermal capacities at constant pressure and at constant volume is 
equal to the number of moles of gas multiplied by the universal gas 
constant. Then, for molar thermal capacities, 


Cp—Cy=R. 
Since R = 2 cal/mole deg, 
Cp —Cy=2 cal/mole deg. 


The Isothermal Process. In order to avoid confusion, it should 
first be emphasised that constant temperature in no way signifies 
that no heat exchange occurs between the system and the surround- 
ings. A system may absorb heat from the surroundings but not use - 
it to raise the temperature. Thus, as is well known, the internal 
energy of a body may increase at constant temperature (e.g., melting 
ice). Moreover, for gas processes, there is another possibility (more 
important than the first): A system may return part of the heat 
received from the external surroundings in the form of mechanical 
work. 
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In the case of actual gases, both ways of expending the heat are 
entirely possible in an isothermal process. The heat transferred to 
a gas causes Lhe gas to expand without the temperature being raised, 
whereupon 1) work is performed on the external surroundings and 
2) the potential energy of the interacting molecules is increased. 

In the case of an ideal gas, whose internal energy depends only 
on the temperature and therefore cannot change in an isothermal 
process, the first law of thermodynamics assumes a particularly simple 
form. Since dU = 0, then AQ = AA. Hence, either the system ex- 
pands, absorbing heat from an external source and performing work 
on some object, or, conversely, the system contracts, releasing heat 
and obtaining energy in the form of mechanical work from external 
bodies. An ideal gas transforms energy in an isothermal process. 
It obtains energy from the surroundings in one form and returns all 
of it to the surroundings, bul in another form. 

It is not difficult in the case of an ideal gas to go over from the 
differential form AQ = pdv to the integral form. The work (we 
can just as well say heat since work and heat are equivalent) of 
an isothermal expansion from volume yy to volume ve is 


ation of the gas state, and 


ituti » pressure from the equ § | 
PGeaaie ie er the integral sign since it 


bringing the temperature out from und 
is a constant, we obtain 
ve 
dv vo 
A=pRT \ w= pRT Ine. 
v1 

he work of equal numbers of isothermal 
expansions at different temperatures differ, being nee the eal 
the temperature. Thus, doubling the volume of a mole of som 


°K (rr ture) requires 8.31 x 
gas at a temperature of 300°K (room tempera SP eacoe 
> _ j of work, while at a temperature ) 
ee times as much, i.e., 17,300 joules. 


3,000°K. the work required is ten i 
"An actual isothermal process may be difficult to achieve. In any 
case, in order for the process to be even approximately isothermal, 


the walls of the vessel through which the substance comes in contact 
with the surroundings must be perfectly heat-conducting. Moreover, 
the process must proceed very slowly, so that the heat (or work) 
is able to return to the surroundings in the form of work (or heat) 
instead of accumulating in the system. 

The Adiabatic Process. Adiabatic compression and expansion 
occur in the absence of heat exchange with the surroundings. This 


It should be noted that t 
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may be achieved by providing conditions that are in a sense the 
reverse of those for an isothermal process, i.e., perfect thermal insu- 
lation must be provided and the process must proceed very rapidly, 
so that heat is not able to escape from the system or be transferred 
to the system. In the case of compression, in accordance with the 
first law of thermodynamics, which is now written in the form 


pdv= —dU, 


the mechanical work is converted into internal energy of the body. 
In the case of expansion, on the other hand, the work is performed 
at the expense of a decrease in the internal energy of the system 
under consideration. 

For the three processes considered above, the changes occurring 
in the pressure, volume and temperature were quite apparent, and 
for gases followed directly from the equation of state. In an adiabat- 
ic process, the nature of the change in the parameters of state is not 
apparent, since all three parameters of state change. The simulta- 
neous solution of two equations—the equations of the gas state and 
the first law of thermodynamics—enables us to determine the rela- 
tionships. Since only the principle involved interests us, we shall 
restrict ourselves to an ideal gas in order to simplify the mathemat- 
ical calculations. Using the expression for the thermal capacity 


of a gas at constant volume, TF = oe and replacing the pressure 


ptdv a? 


p by per , we obtain: — paar tcate Assume that in the initial state 
Dv 


the gas parameters are 14, p;, 7; and in the final state v2, po, To. 
Integrating the last equation from the initial to the final point of the 
adiabatic process, we obtain 


2 2 
wR o =f dT ; _ UR es Pe 
Cy v T Cy " ra 
1 


! 


1 


. . ¢ 
Recalling that cp — c, = wR and introducing the designation = i 
we obtain 


sion. (yr 
Ty, \ve , 


[It is seen from this equation that for adiabatic compression the 
temperature increases and for adiabatic expansion falls. Various 
examples can be cited. Thus, a gas is rapidly expanded when we 
desire fo cool it and carbon dioxide gas escaping from a gas tank 
may turn into dry ice due to the tremendous drop in temperature 
of the expanding gas. On the other hand, adiabatic compression 
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may be used, for example, to ignite some substance. The following 
experiment is often demonstrated: A wad of cotton wool soaked 
in ether is placed in a vessel containing air. Rapid compression of 
the air by means of a plunger causes the cotton wool to burst into 
flame. 

Since we wish to represent the gas processes on a pressure-volume 
diagram, it is necessary to convert the above equation of the adiabat- 
ic process into an appropriate form. Substituting for the tempera- 
ture by means of the equation of the gas state, we obtain 

Py} = povd. 

Comparing this equation with the Boyle-Mariotte law for an isother- 
mal process, important differences may be observed in the nature 
of the pressure change for compression or expansion. For an isother- 
mal expansion or compression, the product pv remains unchanged, 
while for an adiabatic process the product pv’ remains unchanged. 
Since y > 1, the adiabatic on the diagram is steeper than the iso- 
thermal. When the volume is reduced to one-half in an isothermal 
process the pressure doubles, while in an adiabatic process the 
increase is greater. For example, in the case of most diatomic gases, 
for which y = 1.4, when the volume is reduced to one-half, the 
pressure increases to 2.63 times its original value. 

It has already been emphasised that both processes are of an ideal 
character and that the requirements for the creation of the ideal 
Conditions of these processes are opposite. Therefore, it is evident 
that gas processes under actual conditions yield intermediary 
curves between the adiabatic and isothermal curves. 

The difference between the adiabatic and isothermal curves may 
be easily visualised as follows: For adiabatic compression, the gas 
becomes heated, so that for one and the same reduction in volume 
the increase in pressure is greater in the adiabatic process, since 
heating at constant volume leads to an increase in temperature. 

Fig. 78 shows that the work of isothermal expansion is greater 
than the work of adiabatic expansion. On the other hand, the work 
of isothermal compression is less than the work of adiabatic compres- 
sion. We are assuming, naturally, that the initial points of the proc- 
esses coincide. : 

The work of an adiabatic process may be determined graphically 
or by means of formulas. From the first law of thermodynamics, it 
follows that in adiabatic processes the work must equal the change 
in internal energy: ; 
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In the case of ideal gases, the difference in energy is calculated sim- 
ply as U; — Uz =, (T; — T2). Thus, for ideal gases, the work 
may be calculated by means of this formula. 

Another method may be used to determine the work in an adiabat- 
ic process. Since the equation pyvY = pv¥ holds for any intermediate 
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Fig. 78 


point of the process, where the symbols without subscripts designate 
the current values of pressure and volume respectively, the work 
integral may be written in the form 


v2 
A= py} \ & ; 
v 
vy 
Upon integrating from the initial to the final point of the process, 
we obtain 


Yr 4 4 
— Pwr <7 -=—7) . 
aes (3 vy—! 


Naturally, this formula is equivalent to A = c¢, (7; — PT), which 
is easily demonstrated by using the equation of state of an ideal gas 


4 
and converting the above formula (taking ya! out of the brackets) 
to the form : 


aa (el | 


Depending on the given data, one formula may be more conveniently 
used than the other. : 


Let us illustrate by a simple numerical example the statement made in 
Chapter IX to the effect that the increments AQ and AA (note: AQ ~ AA) are 
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not total differentials, i.e., they do not characterise the change of state of a 
system, 
Assume state (1) of a mole of hydrogen (Fig. 79) is characterised by the 


following data: v,= 20 litres, 7, = 300°K and py An, 30 cal/litre 
mn 
(here R = 2 cal/deg). Now, ep — cy = R and, since hydrogen is a diatomic 


c 
gas, —P 1.4. Hence cy = 7 cal/deg and 
ce 


/ 2 


v 
Cy = 5 cal/deg. F 
We shall now consider three possible 
paths by means of which the gas can change 
to state (3), where v3 = 40 litres, 
73 = 300°K and p3; = 15 cal/litre. 
Path 1-8. The work along the isotherm 


A,_3;= RT, ln <3 = 416 cal. These 416 


calories are Ahsarbed from the hot body, 
ane the internal energy = const since 

"1 = Ts. 

' Path. 1-2-3. Here, (1-2) is an isobar. 
Hence, 7’. = 600°K. The heat absorbed 
from the hot body is Qi» = ep (T2— 11) = 
= 2,100 cal and the work against the Fig. 79 
external forces is Ay—-9 = Py (v2 — 4) = 
= 600 cal. Therefore, the internal energy ; 
of the gas increases by AU = 2,100 — 600 = 1,500 cal. Process 2-3 consti- 
tutes isochoric cooling and Qo-3 = ¢p (Tz — 13) = 1,500 calories of heat are 
transmitted to the cold body. Since v, = v3, no mechanical work is per- 
formed. ; ; 

Thus, along the path 1-2-3, the hot body gave up 2,100 cal, 600 cal of work 
was performed and the cold body absorbed 1,500 cal. Along the path 1-3, the 
hot body gave up 416 cal, 416 cal of work was performed and no change in 
the state of the cold body occurred. However, the change in the state of the gas 
was the same for both cases. 


Path 1-4-3. Were (1-4) represents an adiabatic process, whille (#-3) 
4 


— { t 
n y Tage yas ~ 
is an isobar. Now, = Ga) =27, so that v,=20xX2" litres. From 
4 P 
{ 
wis he yy \v-! cae the 
the relation zt = ( ) , we find that 7,=300x2 . Along the 
1 


Lan 
path 1-4, the work against the external forces is performed only at the expense 
1 


of a decrease in the internal energy: Ay_,=—¢y (7,—7)=1,500 (1—2 ¥ veal: 


—-1 
Along the path 4-3, the hot body gives up Q,_-3=¢p (73—T,) =2,100 (1 —2Y ) 
cal of heat and the work against the external forces equals 44-3 = Pp (v3—v,) = 

4 


—-1 
=G00(1—27 ) cal. 
Therefore, along the path 4-3, the internal energy increases just by 
{ 


—-1! 
4,500(1—2Y. ) cal. Thus, the path 1-4-3 has also not led to a change 
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in the internal energy of the gas, which is uniqucly determined by the 
temperature. 


Measuring the Thermal Capacity of Gases. It would appear that 
the easiest way to determine the thermal capacity of a gas is to fill 
a container with the gas to be measured and immerse it in a calorim- 
eter. However, this does not take into account the fact that the 
thermal capacity of a gas is very small with respect to the thermal 
capacity of a container, no matter what solid material is used to 
make the container. Therefore, the thermal capacity of a gas is not 
measured at constant volume, but rather at constant pressure. For 
this purpose, gas under constant pressure moves in a coiled pipe 
that passes through the calorimeter. By means of a thermocouple, 
the temperature of the gas is measured at the input and output of 
the calorimeter. After preliminary heating, the gas enters the calo- 
rimeter and transfers part of its heat to the water. Knowing the 
quantity of gas passing through the container during a particular 
period of time, and the quantity of heat transferred to the water of 
the calorimeter during the same interval of time, the thermal capac- 
ity of the gas at constant pressure, cp, can be easily determined. 
This is done by dividing the quantity of heat by the mass of flowing 
gas times the difference in gas temperature between input and out- 


put. 
To determine the thermal capacity at constant volume, we use 
: : c 
the ratio of thermal capacities, i.e., Poisson’s coefficient: y = = : 
v 


Many methods of determining y have been proposed, some of them 
being based on the measurement of the volume and pressure of the 
gas in a succession of states for an adiabatic process. Other relation- 
ships between the thermal capacities may also be used, e.g., the 
relationship defining the difference between the thermal capacities 
Cp and cy. 

The thermal capacities of various gases are given in the table- 


c c 
uae cal/deg mole cal /dee mole Y 
Helium He ...... 2.98 5.00 1.67 
Hydrogen Hp .... - 4.87 6.87 1.41 
Nitrogen Ng...-.--- 4.86 6.84 1.44 
Oxygen Op .---+-- 4.99 6.90 1.40 
Water vapour H,0.. . 6.65 8.65 4.31 
Methane CH, ....- 6.51 8.51 4.30 
Ethyl alcohol C,;H;0H 18.9 20.9 4.41 
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This is the process in which gas is allowed to flow through a small 
opening from a region of high pressure p, into a region of low pres- 
sure po. The vessel in which the process takes place is thermally 
insulated from the surroundings. 

In accordance with the conditions of the process, p; and ps must 
not change. This is done by having both pistons (Fig. 80) move to the 
right, corresponding to the pas- 
sage of gas into the region of 
low pressure. 

M, the mass of gas that is 
moved from left to right, does not 
Maintain constant volume, but 
changes from 1, to ve, for it has 
entered a region of different pres- 
sure. This transition is accom- 
plished under the action of the 
left piston and the counteraction 
of the right one. The left piston 
does work at the constant pres- 
sure p. This work is equal to p Av, where Av is the change in the 
volume of the gas to the left of the partition. But v, is the change 
in volume on the left, so that the work done by the left piston is 
Piv,. The right piston does negative work, which in this case is equal 
to the product of the pressure p2 and the incremental volume Ue. 
Thus, when the mass of gas MW is transferred from the left region to. 
the right one, the work performed is piv1 — P2V2. The law of con- 
servation of energy requires that the internal energy of the gas. 
change by this same amount. Therefore, 


U2 — Uy = pyr — Provo. 


This formula is valid for any mass of gas. This means that in the 
process of moving a gas from one vessel into another the quantity 


U + pv =const 


‘(called the heat function or enthalpy) does not change. 

For an ideal gas, U and pv both depend only on the temperature. 
Thus, during a Joule-Thomson process, the temperature of an ideal 
gas does not change. 

For actual gases, the situation is different. If the gas is not ideal, 
the temperature may increase or decrease during a Joule-Thomson: 
process, depending on the nature of the interacting forces between 
the molecules. 
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It is noteworthy that a particular gas may behave differently at 
different temperatures. At a high temperature, the temperature 
increases during a Joule-Thomson process, while at a low Lemper- 
ature, it decreases. The point of inversion corresponds to the temper- 
ature at which the change in sign occurs. This temperature for oxy- 
gen and nitrogen is above room temperature. Therefore, the air is 
cooled during a Joule-Thomson process conducted at room temper- 
ature—not to speak of lower temperatures. For hydrogen, the inver- 
sion temperature is very low. The Joule-Thomson effect below the 
inversion temperature finds application industrially in the lique- 
faction of gases. 


CHAPTER XI 
ENTROPY 


62. The Principle of Entropy Existence 


In the middle of the last century, an important discovery was 
made regarding reversible thermodynamic processes. It was found 
that side by side with internal energy a body has yet another remark- 
able function of state, namely, entropy. Just as in the case of inter- 
nal energy, entropy includes an arbitrary constant. Experiments 
yield the incremental difference in entropy. If a body or system 
absorbs the heat AQ during an infinitesimally small transition from 


One state to another at a temperature 7, the ratio Se is a total differ- 


ential of some function S. This function is the entropy and is thus 
determined by the two following equivalent equations: 
2 
ds ="8 and S,—S,= \ 2. 
{ 
; : , sete fan NOI 
The statement that a function exists whose differential is = is 
known as the principle of entropy existence. It is one of the most 
important laws of nature and an essential part of the second law of 
thermodynamics, which will be discussed below. The discovery of 
this principle, as well as the entire second law of thermodynamics, 
is primarily associated with the names of ‘Carnot and Clausius. 
In spite of its somewhat abstract nature, the essence of the principle 
is easily understood and may be summarised as follows: A body may 
change from one state to another in an infinite number of ways (re- 
presented on a diagram by the various curves connecting the same 
initial and final points); and, although the body may absorb various 


one : A "1: 
amounts of heat during such transitions, the integral (= will in 


1 
all cases have the same value. Ae , the ratio of the quantity of heat 
to the temperature at which this heat was absorbed, is sometimes 
called the reduced heat. Since an integral may always be approxi- 
mately represented as a summation, the change of entropy in trans- 
ferring from one state to another is equal to the summation of the 


reduced heats. Let us assume that the body absorbs a calorie per 
12-1409 
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degree as it is uniformly heated from 20°C to 25°C. The increase 
in entropy is then 
1 cal 1 cal 1 cal 1 cal 1 cal 
S.— 51 = 593.5° + 294.53 ' 295.5° T 396.55 + 597.59 . 


For isothermal processes, the expression for the change in entropy 
is very simple: 


S.—S,=4, 
where Q is the heat absorbed during the process. Thus, when 1 kg 
of ice melts the entropy of the substance increases by ae bos 
= 0.29 ae , 


In applying the concept of entropy, the value of entropy at any 
state (e.g., boiling water or melting ice) may be adopted as zero 
entropy. However, in some cases, the value of the entropy of a sub- 
stance at absolute zero is adopted as zero entropy. There is, inciden- 
tally, some theoretical basis for this (Nernst’s theorem), but we 
shall not go into it. 

Assuming S = 0 at T = 0, the entropy of a substance at the 
temperature T may be determined by the formula 


if the heating occurs at constant pressure. As can be seen, the depend- 
ence of the thermal capacity on the temperature must be known in 
order to determine the entropy. 

The entropy may be easily calculated (except for the arbitrary 
constant) if the equation of state of the substance is known. By defi- 
nition, dS = 42 | Substituting the value for AQ obtained from the 
equation for the first law of thermodynamics, we obtain 


dT d 
dS = y= + ps 7 


By means of the equation of the gas state, we can eliminate the 


: : ee dT d , 
pressure from this equation, obtaining: dS =c,- + wR — . Taking 


the indefinite integral, we obtain an expression for the entropy that 
includes an arbitrary constant: 


S=cyln7+pRlnv + const. 
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It is also possible to take the definite integral of dS, where the 
limits are two states. We then obtain the following’ expression for: 
the entropy difference between the two states: 

_ T2 V2 * 
S,—S,=coln + wR In . 
This is the expression for the entropy of ideal gases. It is seen from 
the formula that the entropy increases as the temperature and the 
volume of the gas increase. Naturally, this is in agreement with the 
general statement that the entropy increases when heat is trans- 
ferred to a body. 


Example: Using the example on p. 173 (Fig. 79), we shal] show that the 
entropy is indeed a function of the state of a system: 
Path 1-2-8. The change of entropy 


: To v2 _ I= g —cal 
S2—S, = ey In + Bln aoe popoutit Ine Tale deg * 
a G5 ] i__s In p__cal__ The total 
The change of entropy S3—Sg,=9 In >= “mole deg ° 
. cal 
change of entropy along path 1-2-3 is S3—5S,=2In2—— a deg . 
Sy eh ee ee 
Path 1-3, 83 Sy=2ln2— deg’ 
Path 1-4-3, Since (1-4) is adiabatic, S,—S,=0. 
i 1-7 l 
Fiat § v Sa Detar i con en NOt 
S3—Sy=epin 724+ 2in Te = 5 In2 +21n2 =2|n2—, deg ° 


It is seen, indeed, that no matter how the transition of the gas from state (1) 
to state (3) is effected the change of entropy is the same. 


63. The Principle of Increasing Entropy 


As already stated, reversible processes, strictly speaking, do not 
exist. However, many processes occur that do not, practically, 
differ from reversible ones. But there are some processes that are 
always unidirectional and as a result can never be made reversible. 
Thus, gas may expand of itself, but it cannot be compressed without 
the application of an external force. Heat may spontaneously pass 
from a hot body to a colder one, but it can pass from a cold body to 
a hotter one only if work (e.g., electric energy) is expended. In the 
presence of friction, the kinetic. energy of macroscopic motion is 
always converted into internal energy, but the reverse process never 
occurs spontaneously. All other irreversible processes are in the 
final analysis based on the fact that in each of them, to one degree 


12* 
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or another, one of the enumerated unidirectional processes occurs. 
In actual processes, it, is impossible to avoid spontaneous expansion, 
friction and thermal dissipation. 

Do not all the enumerated unidirectional processes have a common 
characteristic? As a matter of fact they do: in all unidirectional 
processes, the entropy increases. 

In the case of heat exchange between two bodies, the overall 


change in entropy of the entire system is S. — S=24@, 


{ 2 
where Q, is the heat absorbed by the colder body and Q, is the heat 
given up by the hotter body. 

If T, is greater than 7,, then Q; = — Q,. > 0, Since heat trans- 
ferred to a body is considered positive. Hence, S, — S, = 
= 0, (7-7) > 0, i.e., during heat exchange, there is an increase 
in the overall entropy of the system in which the heat exchange 
occurs. 

Let us take another case. Assume intensive mechanical motion 
{e.g., rotation of a wheel) takes place in a vessel containing gas. 
The volume does not change, but the temperature increases and, 
hence, the entropy changes by S, — S,; =c, ln 2 , le., it also 
increases. ‘ 

Finally, upon expansion into an evacuated vessel at constant 
temperature, the increase in entropy, Sz — S, = pR n=, is again 

1 
positive. Thus, in all unidirectional processes, the entropy of the 
system increases. 

It is easily seen that this conclusion regarding all irreversible 
processes is of great importance. Since each irreversible process is 
accompanied by unidirectional effects serving to increase the entro- 
py, the increase in entropy in an irreversible process is greater than 
the increase that would have occurred if the process were reversible. 
Let _AQ be the heat absorbed by a body at temperature T in an irre- | 
versible process. If the process were reversible, the increase in entro- 


py would equal ae In an actual process, however, the increase in 


entropy is greater than this value: 
AQ 
dS > a 


If the system is thermally insulated, then AQ = 0 and the above 
expression assumes the form dS > 0, i.e., in a thermally insulated 
system, only processes serving to increase the entropy are possible. 

It is quite clear that entropy and internal energy are the most 
important functions determining a thermodynamic process. Thus, 
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if entropy is analogous to the manager of a process, internal energy 
is analogous to the bookkeeper. While entropy determines the direc- 
tion of flow of the process, the energy “meets the expenditures” of 
conducting it. 

If in the above formula the symbol >. is used instead of the sym- 
bol >, the law of entropy for reversible and irreversible processes 


may be described by the following simple formula: dS >4e . This 


formula expresses the essence of the second law of thermodynamics. 
For closed systems, the second law states that the entropy of a ther- 
mally isolated system increases or remains the same. 

Both laws of thermodynamics may be combined in the single 


formula dS ie , which is applicable to all practical ther- 
modynamic problems. 


Examples. 1. In the example on p. 68, we considered the nonelastic colli- 
sion of a bullet with a ballistic pendulum and showed that, upon collision, 
399.6 kg-m of mechanical energy are dissipated in the bullet-pendulum system. 
This means that the bullet irreversibly transfers AQ = 935 cal to the pendulum 
through heat conduction. If it is assumed that the process is isothermal (i.e., 
the thermal conductivity of the pendulum is extraordinarily high), and that 
the temperature is, say, 27°C, then the entropy of the system in this irre 
versible process will increase by . 


AS = AG = 3.12 cal/deg. 


2. A rubber ball weighing 0.3 kg rises 1 metre off the floor after being dropped 
from a height of 2 metres. In this isothermal process (assume t = 27°C), 
we transfer AQ = 0.3 kg-m irreversibly, i.e., the entropy of the ball-floor 
system increases by 


AS = 2.35 X 1073 cal /deg. 


If the ball and floor were absolutely elastic, the entropy would not have 
changed (AS = 0) and the motion of the ball would have continued eternally. 

3. Let us consider the irreversible process involved in the transfer of heat 
from a steam boiler to a condenser. Assume that the steam boiler is at a tem- 
perature 2; = 300°C and the condenser at a temperature tg = 30°C. For a boiler 
thermal capacity of 10,000 kW and an efficiency of 25 per cent, 7.5 X 10° joules 
will be transferred from the boiler to the condenser every second. Since 
the boiler loses heat, its AQ will be negative, i.e., the boiler loses entropy. 
For the condenser, on the other hand, the entropy increases. However, since 
T,| >To, the entropy of the boiler-condenser system will increase each second by 

1 


“AS=AQ (F--7) = 2.81 x 108 cal/deg. 


64. The Principle of Operation of a Heat Engine 


A heat engine converts heat into work. In other words, it takes 
heat from some bodies and transfers it to others in the form of mechan- 
ical work. In order to accomplish this conversion, we must have 


182 Entropy 


at our disposal two bodies at different temperatures, between which 
heat exchange is possible. The hotter body will be designated as the 
hot body and the colder one as the cold body. In the presence of two 
such bodies, the process of conversion of heat into work may be 
described as follows: A substance capable of expanding (the working 
substance) is brought into contact with the hot body. Heat Q, is 
taken from the hot body and is expended on the work of expansion, 
A,,° which is transmitted to surrounding bodies. The working sub- 
stance is then brought into contact with the cold body and transfers 
heat Q» to it at the expense of the work A, performed on the working 
substance by the external forces. : 

To obtain a continuously operating heat engine, the compression 
process must be concluded where the expansion process began. In 
other words, the overall process must be cyclic. The working sub- 
stance returns to its initial state at the end of each cycle. Hence, 
the law of conservation of energy requires that the energy obtained 
from the surrounding bodies equal the energy transferred to the sur- 
rounding bodies. The working substance obtained the heat Q, dur- 
ing expansion and the work A, during compression. On the other 
hand, it gave up the work A, during expansion and the heat Q> 
during compression. Hence, Q; + Ap = Q.-+ A; or Ay— Az = 
= Q; — Q2. When the cyclic process is conducted clockwise, the 
work of compression is less than the work of expansion. Therefore, 
the last equation expresses the simple fact that the network trans- 
mitted to a working substance by an external medium is equal to 
the difference in the heat absorbed from a hot body and given up 
to a cold body. Accordingly, the efficiency of the cycle and, hence, 
of the engine as a whole is ; 


Qo 
a= Qs * 

The described process for the operation of a heat engine is, natu- 
rally, an abstraet scheme. However, the essential features of every 
heat engine are incorporated in this scheme. An expanding and con- 
tracting gas or steam is the working substance, the surrounding 
medium plays the role of cold body, and a steam boiler, or a fuel 
mixture in internal combustion engines, serves as the hot body. 

A refrigerating engine, in which the cycle is reversed, requires 
the same three system components. The principle of operation of 
this engine consists in the following: Expansion of the working sub- 
stance occurs when it is in contact with the cold body. Thus, the 
cold body is cooled even further, which is precisely the task of the 
refrigerating engine. Now, in order to complete the cycle, the work- 
ing substance must be compressed and the heat given up by the 
cold body rejected. This is accomplished when the working sub- 
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stance is in contact with the hot body. Thus, the hot body becomes 
even hotter. The “unnatural” transfer of heat from a cooler body to 
a hotter body is at the “expense” of work. We see, then, that when 
the cycle is conducted counterclockwise, the relationship between the 
energy transferred to a medium and the energy absorbed from 
a medium, i.@., Q1 + Ao => Qs + Ay or Qe aaa! Q1 = (Ay == Aa), 
where as before the subscript 1 refers to the portion of the process 
occurring when in contact with the hotter body, has the following 
meaning: The quantity of heat removed from a system must be 
compensated for by an equal quantity of mechanical work. 

The second law of thermodynamics imposes certain conditions on 
the operation of a heat engine. If a process is assumed Lo be revers- 
ible, the change in entropy of the working substance for the entire 
cycle should equal zero. Stated otherwise, the change in entropy for 
the expansion process must equal (except for reversed sign) the 
change in entropy for the contraction process, i.e., 


SQ e 


In the case of an irreversible process, the entropy of the closed sys- 
tem, consisting of the hot body, the cold body and the working 
substance, increases and, therefore, 


Sat) >o 


(It should be recalled that Q is an algebraic quantity. Thus, heat 
entering the system is considered positive.) By evaluating these 
integrals for specific processes, it is rather simple in a number of 
cases to determine the maximum efficiency of one or another heat 
engine cycle. . 
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We shall now derive the expression for the efficiency of an ideal 
heat engine operating without,losses in a reversible cycle. 

Let us first consider the theoretical four-stroke Carnot cycle rep- 
resented in Fig. 81. The Carnot cycle consists of two isothermals 
(for temperatures 7, and T,) and two adiabatics. Assume that the 
first strokes of the cycle is an isothermal expansion from state J to 
state 2—the working substance is in contact with the hot body 
whose temperature is 7, and the process takes place very slowly. 
When state 2 is reached, contact with the hot body is broken, the 
working substance is thermally isolated and it has the possibility 
of expanding further. Work occurs at the expense of the internal 
energy and the temperature of the working substance is allowed to 
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drop to T,. From this point (state 3), two-stroke contraction begins. 
The working substance comes in contact with the cold body at tem- 
perature 7’, and isothermally contracts to state 4. Here, the working 
substance is again thermally isolated and the contraction continues, 
now adiabatically, with the work- 
ing substance being heated, at 
the expense of performed work, 
to the initial temperature 7. 
The adiabatic processes in a 
Carnot cycle are of an auxiliary 
nature, enabling us to transfer 
from one isothermal to another. 
These processes do not enter into 
the energy balance, since c, (J; — 
— T.), the work of adiabatic 
expansion, and c, (7, — T;), the 
work of compression, cancel each 
other. 
Fig. 84 In an adiabatic process, the ent- 
ropy of a system does not change. 
During isothermal expansion, the entropy of the hot body decreases 


by # and the entropy of the cold body increases by a. The working 


substance returns to its initial state with its entropy unchanged. 


=% 


If the process is reversible, then ++ C1 . For irreversible processes, 


Ty 
the entropy of the entire system, consisting of the cold body, the 
hot body and the working substance, increases, i.e., the entropy 


increment es is greater than the decrement a ; 
2 1 
IQ) LQ) 
een 


Thus, | $2 tlog Ps and, therefore, the efficiency of a Carnot cycle is 


The efficiency of the cycle is determined by the temperatures 
of the cold and hot bodies, respectively. The greater the drop in 
temperature the greater the efficiency of the engine. It is not dif- 
ficult to see that the efficiency of a Carnot cycle is the maximum 
efficiency possible. There is no cycle better than the Carnot cycle 

and, in this sense, it serves as a model for designers of heat engines. 
They strive to make actual cycles approach the cycle of this ideal 
engine. 
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It is not difficult to prove that the efficiency of a Carnot cycle is the opti- 
mum. Fig. 82 shows an arbitrary cycle inscribed in a Carnot cycle. The de- 
Crease in the entropy of the hot body may be repre- 
sented by the integral 


B 
dQ 
\ 
A 
for which the inequality 
B dQ ' B 
>a \ age 


A 


Fig. 82 


is undoubtedly valid, since 7, is the largest value 
assumed by Tf in the integration. The increase ; 
in the entropy of the cold body is . expressed by the integral 


for which the inequality 


e d 4 c Q 
(7 <a Onn, 


is valid, since 7 is the smallest value assumed by 7 in the. integration. For 
a reversible process, 


Therefore, 
| Qe | | Q1 | 
Ll. > T, : 
which yields the condition: 
=< 1 Le 
Nmax * Te 


Thus, the Carnot cycle has the maximum efficiency of all possible 
cycles. 

This maximum efficiency formula shows why steam engines have 
low efficiency. At ZT, = 300° and 7, = 400°, the efficiency is 25 per 
cent. Moreover, this is the maximum efficiency, attained by an 
ideal reversible engine operating without any losses in energy. It 
is, therefore, not surprising that in actua] steam engines the efficien- 
cy is below 10 per cent. Courses in steam engineering discuss means 
used to increase the efficiency. Clearly, the most important method 
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is to increase the temperature of the hot body, i.e., the steam or 
fuel mixture. 


66. The Second Law of Thermodynamics 


As indicated above, the second law of thermodynamics states 
that the entropy in a thermally isolated system increases. This state- 
ment may appear to be somewhat abstract, but it is not the form 
in which this idea was expressed historically. In view of the tremen- 
dous importance of this law of nature, we shall briefly discuss other 
important formulations of the second law of thermodynamics and 
show that they are equivalent to the above. 

Historically, the second law of thermodynamics was expressed 
in the form of Thomson’s postulate on the impossibility of creating 
a perpetual engine of the second kind. A perpetual engine of the first 
kind creates work “out of nothing”, i.e., its work violates the first’ 
law of thermodynamics. A perpetual engine of the second kind 
produces work by means of a periodically operating engine merely 
by absorbing heat from the surrounding medium. If such an engine 
were possible, it would be practically eternal, for the supply of ener- 
gy in the surrounding medium is almost limitless and the cooling, 
say, of the oceans’ waters by one degree would yield an inconceiva- 
bly large amount of energy. The mass of water on the Earth is of 
the order of ~10'8 tons. If this entire mass of water were cooled by 
only 1°, the heat released would be about 10”! kcal, which is equiva- 
lent to the complete combustion of 10'* tons of coal. Rolling-stock 
loaded with this quantity of coal would extend for a distance of 
~10!° km, which is the order of magnitude of the dimensions of the 
solar system! 

A perpetual engine of the second kind is a heat engine working 
with a hot body, but without a cold body. If such an engine were 
possible, it could work on a single stroke. A gas contained in a cyl- 
inder with a piston could indeed expand, but the operation of the 
engine would end there, since for the engine to continue operating, 
the heat absorbed by the gas must be transferred to a cold body. 
Formally, the formula for maximum efficiency shows that a per- 
petual engine of the second kind is impossible. In the absence of 
a temperature drop (7, = T;), the maximum efficiency is equal to 
zero. 

It is impossible to design a periodically operating perpetual engine 
by combining an isothermal expansion with an adiabatic com- 
pression process. Such a process would not be possible even if we 
could make it reversible. For isothermal expansion of the working 
substance, the entropy decreases. Hence, the compression process 
would have to yield an increase in entropy. This, however, is not 
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possible for an adiabatic process, since it proceeds at constant 
entropy. 

The postulate of Clausius also completely corresponds to the 
formulation adopted here for the second law of thermodynamics. 
It states that heat cannot be transferred from a colder body to 
a hotter body without compensation. A process contradicting the 
postulate of Clausius would take place with a decrease in entropy. 
At the very beginning of our discussion, this was shown to be impos- 
sible. 

We shall again return to the second law of thermodynamics in 
Sec. 77, where it will be discussed from the standpoint of the kinetic 
molecular theory. 


CHAPTER XII 
KINETIC THEORY OF GASES 


67. General 


If the molecules of a solid body are assumed to be contiguous, we 
can accurately determine their dimensions by X-ray analysis (p. 385). 
Then by comparing these dimensions with the space available to 
a molecule in a gas, the fundamental properties of the gaseous state 
of matter may be immediately determined. 

The largest linear dimensions of a diatomic molecule of oxygen 


is about 4A. Nitrogen molecules have approximately the same 
dimension, but molecules of hydrogen are considerably smaller. The 
volume of an oxygen molecule is about 10-?% cm’. Since under normal 
conditions there are 2.7 X 10!® molecules in 1 cm® of oxygen, the 
space available to a molecule is about 0.4 x 10-?® cm?. Comparison 
of the volume of a molecule with the space available to it shows how 
little of the space is occupied by molecules. It is-evident that’ for 
such a low density collisions between molecules will be relatively 
rare. On the average, the length of the path traversed by a molecule 


between consecutive collisions is 1,000 A. However, the velocity 
of a molecule is large, about 500 m/sec, so that on the average 
a collision occurs every ten-thousand millionth (10!) of a second. 
In will be shown below how these figures were obtained. 

Molecules begin to draw together only when the distances between 
them become comparable to their own dimensions. Therefore, for 
a large part of their path, molecules move rectilinearly and uniform- 
ly. Only when one molecule comes within range of another does the 
force of interaction become effective. Since the interaction occurs 
over an insignificantly small portion of the path, we can speak of 
a collision between the molecules. The interval of time during which 
molecules perceptibly interact—in other words, the impact time—is 
equal to about 10-™ sec. Thus, a molecule spends by far the greatest 
part of its “life” in free motion subject to inertia. 

This is the situation for gases under normal conditions. An in- 
crease in pressure, leading to an increase in density, may consider- 
ably alter the picture. 

The internal energy of gases in which interaction between mole- 
cules occurs only for the time of instantaneous collision does not 
contain potential energy of interaction between molecules. Such 
gases are called ideal. The use of one and the same term a second 
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time will be shown to be justified by demonstrating the validity of 
the equation of the gas state for such gases. 

Thus, a gaseous substance consists of a tremendous number of 
minute particles that pass through large spaces without colliding, 
then collide like billiard balls and fly apart in different directions 
with different velocities, until the next collision. If we were to trace 
the path of a single gas molecule (naturally, this can be done only 
mentally), we would find it moving now to the left or to the right, 
now forward or backward. Sometimes it would be moving with 
a large velocity and at other times it would be moving slowly. In 
view of the chaotic nature of thermal motion in a gas, the molecules 
of a free gas in thermal equilibrium may be considered to have uni- 
form density distribution throughout its volume. Furthermore, at 
a given instant, there will undoubtedly be equal quantities of mole- 
cules moving in all directions. Other random events will similarly be 
uniformly distributed. For example, at all locations, equal numbers 
of molecules per second of observation will be travelling without 
collision a distance of 100 A to 200 A. 

It must be realised, however, that these statements are of a statis- 
tical character. They are valid on the average, whereby the greater 
the number of gas molecules involved the greater the validity. 

We assert, for example, that the number of molecules moving 
“to the right” is the same as the number moving “to the left”. Natu- 
rally, this does not mean that the numbers are equal to within sev- 
eral units. The number of molecules involved is so large that not 
only is a difference of several units insignificant, but even a differ- 
ence of several million is negligible percentagewise. 

If numerous measurements are taken of the gas density of a given 
volume, the values obtained for the number of molecules will differ 
somewhat from measurement to measurement. From these data, we 
can determine the average value for the number of molecules in the 
volume under consideration. If it were possible to measure within 
an accuracy of even several thousand molecules, the individual 
measurements would oscillate, percentagewise, to an insignificant 
extent about this average value. : 

When it is stated that a number of molecules have such and such 
a velocity, or move in some direction or other, or collide in accord- 
ance with some mechanism or other, then the average value of the 
number is always meant. If the number of gas molecules is large, 
the deviations of the instantaneous values from the average, i.e., 
the fluctuations, are negligible. In a very rarefied gas, however, the 
fluctuations may be considerable. 

It is shown in the theory of probability that, using absolute val- 
ues, the average relative deviation of the gas density from the average 
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is approximately equal to ae , Where x is the number of molecules. 


n 
in a unit volume. Since there are 2.7 < 10!® molecules in 1 cm* 


of gas, the fluctuation of the gas density within one cubic centimetre 
amounts to 


: 4 
V2.7 x 1019 ’ 


i.e., 2 X 10-1° from the average value. It is evident that such devia- 
tions are beyond experimental observation. 

This is how matters stand with respect to all gas properties that 
are determined by the average number of molecules. 

The origin of the kinetic theory of gases dates back to Daniel Ber- 
noulli (1700-1788). M. V. Lomonosov (1711-1765) also made sub- 
stantial contributions to its development. In the 19th century, 
the kinetic theory of gases developed under Clausius (1822-1888), 
Maxwell (1831-1879) and Ludwig Boltzmann (1844-41906) and as- 
sumed its modern form. 


68. Mean Free Path 


The distance traversed by a molecule between two consecutive 
collisions (the range of a molecule) is, naturally, a random quantity 
that may sometimes be very small or very large for individual mole- 
cules. However, in view of the chaotic nature of the particle motion, 
the average value of this quantity for a given gas state is undoubted- 
ly constant. The mean free path or, for brevity, the range 1 is relat- 
ed to the average velocity v of the molecular motion and the average 
time t between two collisions by the simple relation: 1 = vt*. Typ- 
ical values for these quantities were cited on p. 188. 

The range of a molecule depends, in the first place, on the number 
of molecules in a unit volume of gas. Moreover, it is evident that the 
larger the dimensions of the molecule the smaller the mean free path. 

In order to visualise the character of this relationship, let us con- 
sider a cylindrical volume of gas through which a molecule moves 
along the cylinder axis. What is the path taken by the molecule? 

Molecules are not points. They have dimensions determined by 
the distances for which molecular interaction becomes effective. 

On the basis of crystallochemical measurements (see p. 385), we 
may, with considerable accuracy, ascribe a certain form to molecules. 


* Since we are merely concerned with the determination of the connection 
between the physical quantities and not with the determination of the exact 
formulas, we shall not differentiate between average and root-mean-square 
velocities (see below). 
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At distances extending beyond the limits of the molecule’s “bound- 
ary”, the forces of interaction are, practically, not effective. 

Let us project the maximum cross-section of the molecules on to 
the base of the cylinder. Each molecule will be projected differently. 
Since there are many molecules, the average cross-sectional area 
will characterise a molecule with sufficient accuracy. This average 
area of cross-section o is called the effective cross-section. 

A collision will surely occur along the length of the cylinder if 
the area of the cylinder base is completely filled with the cross- 
sections of the molecules. If the cylinder base is equal to 1 cm?, 
cylinder length equal to J, and the number of molecules per unit 
volume equal to m, then there will be a total of nl molecules in the 
cylinder. The projections of the cross-sections of these molecules 
will completely cover the cylinder base when nlo = 1. Under these 


conditions, the value of 2 will have an order of magnitude that is 
‘ 1 p 

close to the average range of the molecule, i.e., l= aae More rigorous 

calculations confirm the validity of this rough estimation. In the 


exact formula, the factor |/2 enters in the denominator: 
1 


LS 5 
V 2x0 


where o has a constant magnitude for a given gas. Thus, the mean 
free path is determined only by the density. A decrease in density 
by a factor of 100, for example, results in an increase in the mean 
free path by the same factor. , 

For air under normal conditions, the effective cross-section o is 
approximately equal to 5 x 107% cm®. This is in excellent agree- 
ment with the dimensions of oxygen and nitrogen molecules obtained 
from crystal measurements. The maximum dimension is equal to 
4.3 A and the minimum is a little less, namely,,3 A. The radius of 
a circle having an area of 5 x 107 cm® is 4 A. 

We can determine the dimensions of molecules by studying crys- 
tals. However, the investigation of particle collisions may be viewed 
as a method of establishing the effective cross-section of particles. 
This method is of value in studying atomic nuclei (p. 554). 


The mean free path under normal conditions is: in air—600 A, in nitro- 
gen—600 A, in hydrogen—1,100 A and in helium—1,800 A. 
69. Gas Pressure. Root-Mean-Square Velocity of Molecules 


Let us consider the problem of using the’ simplified concepts 
regarding the motion and interaction of gas molecules to express the 
gas pressure in terms of the quantities characterising the molecule. 
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Assume that we have a gas enclosed in a spherical tank of radius 
R and volume v. Disregarding collisions between gas molecules, we 
may adopt the following simple scheme for the motion of each mole-' 
cule: A,molecule moves rectilinearly and uniformly with some veloc- 
ity v, strikes the wall of the vessel 
and rebounds at an angle equal to 
the angle of incidence (Fig. 83). Tra- 
versing chords of equal length, 2R 
sin 0, time after time, the molecule 


strikes the wall of the vessel SRT 


times per second. For each impact, 
the momentum of the molecule 
changes by 2 mv sin 0 (see p. 68). 


The change in momentum per sec- 
mee 
ond is equal to 7° 

We see that the angle: of inci- 

; dence cancels out. If the molecule 

Fig. 83 strikes the wall at an acute angle, 

the impacts will occur often, but 

will be weak. If the angle of incidence is close to 90°, the molecule 

will strike the wall less often, but will make up for it by stronger 
impacts. 

The change in momentum for each impact of the molecule on the 
wall contributes to the overall force of the gas pressure. It may be 
assumed in accordance with the fundamental law of mechanics that 
the force of the pressure is simply the change occurring in the momen- 


tum of all the molecules in one second: mut = + ...or, factoring 


out the constants, 


J (vitvit...). 


Assuming 7 molecules are contained in the gas, we may introduce 
the concept of the average of the velocity squared of a molecule, 
which is determined by the formula 


v= Lwi+ vit ites 


The expression for the force of the pressure may now briefly be 
written as follows: 


mnv2 


F=-5 
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Dividing this expression by 4 R*, the surface area of a sphere, we 
oblain the gas pressure: 


nmv2 
~~ 473 * 


Replacing 41" by 3V, the following interesting formula is obtained: 


= 2 
pV =t ane? or pV =n (=). 

Thus, the gas pressure is proportional to the number of gas mol- 
ecules and lo the average value of the kinetic energy associated with 
the translatory motion of a gas molecule. 

A very important conclusion may be drawn by comparing the 
oblained equation with the equation of the gas state. Comparison of 
the right-hand members of the equations shows that 


2 mez \ mv2 
PRE = =n (oe ) dee Le T, 


i.e., the average kinetic energy of molecular translation depends 
only on the absolhute temperature and, moreover, is directly pro- 
portional to it. 

This conclusion shows that gases obeying the equation of the gas 
state are ideal in the sense that they approximate the ideal model 
of a group of particles whose interaction is insignificant. Further, 
it shows thal the concept of absolute temperature, introduced empir- 
ically as a quantity proportional to the pressure of a rarefied gas, 
has a simple kinetic-molecular interpretation. The absolute temper- 
alure is proportional to the kinetic energy of molecular translation. 


The ratio me = N is known as Avogadro’s number. It is the number 
of molecules in one gram molecule and is a universal constant: 
N = 6.02 x 10*8. The reciprocal quantity + is equal to the mass of 
a hydrogen atom: 
My = += 1.66 x 10°-*4 gm. 
Another universal constant is the quantity 
k= HE =F 1.38 x 107 erg/deg, 


which is called Boltzmann's constant. Thus 


mee 3 
SS PAT. 


{38—14a0 
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If the velocity squared, v*, is represented by the sum of the squares 
of its components, v? = vi + v, + vi, it is evident that the average 
energy for each component is 


1 ° 
x AT. 


This quantity may be described as the energy associated with one 
degree of freedom. 

The universal gas constant is accurately known [rom experiments 
with gases. The determination of Avogadro’s number and Boltz- 
mann’s constant, which are expressed in terms of each other, is a 
relatively difficult task involving delicate measurements. 

These results put at our disposal useful formulas for calculating 
the average molecular velocity and the number of molecules in 
a unit volume. 

Thus, for the average of the velocity squared, we obtain 


52 BAT _ BRT 


mN” M ’ 


where M is the molecular weight. The square root of the average of 
the velocity squared is called the root-mean-square velocity: 


bey? Or Urms = 

i.e., the r-m-s velocity is directly proportional to the square root of 
the temperature and inversely proportional to the square root of 
the molecular weight. It is easily determined that at room tempera- 
ture oxygen molecules have a velocity of 480 m/sec and hydrogen 
molecules—1,900 m/sec. At the temperature of liquid helium, these 
molecules would have, respectively, velocities of 40 m/sec and 
160 m/sec, while at the temperature of the surface of the Sun, namely 
6,000°, these velocities would be 2,160 m/sec and 8,640 m/sec, 
respectively. These examples are unrealistic, however, for, at the 
temperature of liquid helium, oxygen and hydrogen solidify and no 
translatory motion of the molecules will occur, while at the temper- 
ature of the surface of the Sun the molecules disassociate into atoms. 

We obtain the following simple expression for the number of 
molecules in a unit volume: 


Avogadro's law follows from this and may be stated as follows: 
For equal pressures and temperatures, all gases contain the same 
number of molecules per unit volume. Thus, under normal conditi- 
ons (at a pressure of 1 atm and a temperature of 0°C), 2.683 x 10" 
molecules (Loschmidt’s number) are contained in 41 cm’*. 
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70. Internal Energy of a Gas 


The properties of monatomic gases are determined by the kinetic 
energy of translation of the molecules. An atom’s internal energy 
does not affect the thermodynamics of the gas. Evidently, the inter- 
nal energy need be considered only when the temperature of the gas 
is very high and collisions between atoms may lead to their excita- 
tion and ionisation. These processes will be discussed in detail 
later on. 

Thus, the following formula for the internal energy of a monatom- 
ic gas will have very broad application: 


mv2 
U=N%, 
2 


where N is the number of molecules. Using the formulas of the pre- 
vious article, we obtain for 4 mole of an ideal monatomic gas the 
expression 


3 
U=ZRT. 


Hence, for the thermal capacity of 1 mole of a monatomic gas, we 
obtain by means of the formulas of Sec.' 60: 


=k 
and 
o=F R. 


The direct proportionality between the temperature and the 
internal energy, and hence the constancy of the thermal capacities 
of a monatomic gas, are valid for quite a broad interval of external 
conditions. 

For polyatomic gases, such a simple picture is valid for a signif- 
icantly narrower interval of temperatures, if valid at all. The reason 
for this is that the energy of a polyatomic molecule consists of the 
energy of translation, the energy of rotation and the energy of vibra- 
tion of the molecule’s components (i. e., the molecule’s atoms) with 
respect to each other. Calculation of the average energy per molecule 
becomes quite difficult. It turns out that the energy of a molecule 
is no longer linearly dependent on the temperature and, hence, 
that the thermal capacities are no longer constants independent 
of the magnitude of the temperature. Nevertheless, it is usually pos- 
sible to find a narrow interval of temperatures in which the thermal 
capacities do not depend of the temperature. This occurs for such 
values of temperature at which the average energy of the molecule 
is not yet sufficient for the collisions of the molecule to lead to 


13* 
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a change in its vibratory state. At the same time, this energy is suf- 
ficiently large so that the discrete (quantum) character of the energy 
of rotation is not felt. Jumping ahead and referring the reader lo 
Fig. 266 (p. 606), it may be stated that linear dependence between 
energy and temperature, and constancy of thermal capacity, will 
occur when the quantity 7, descriptive of the order of magnitude 
of the translational energy of the molecule, is considerably greater 
than the distances between rotational energy levels and less than the 
distances between vibrational energy levels. 

If such an interval exists, the energy of a mole of gas and the 
thermal capacities of this quantity of gas are expressed by the fol- 
lowing simple formulas: 


U=3RT, 
Cp = 3R, 
Cp=4R. 


The doubling of the internal energy and c, with respect to a mona- 
tomic gas may be explained in the following manner. A polyatomic 
molecule has six degrees of freedom, while a monatomic molecule 
has three. Since there are twice as many degrees of freedom, there is 
twice as much internal energy. To be sure, there is nothing self-evi- 
dent about this statement. However, we find support for this view- 
point when we consider a gas consisting of diatomic molecules. 
Since a diatomic molecule is a system consisting of two particles, 

it possesses five degrees 
Cy of freedom (see p. 44). If 
the internal energy is 
indeed proportional to the 
number of degrees of 
freedom, then for a gas 
consisting of diatomic mol- 
ecules the following for- 
mulas should be valid: 


5) 5 
U= Rf, =a 


5R 
2 
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Fig. 84 


Experiments show that 
in the temperature range 
in which the thermal capacity remains unchanged these formulas 
are quite applicable. The internal energy of one mole of a diatomic 
gas at aroom temperature of 300°K is 1,500 cal = 6,250 joules. 

A typical dependence curve for thermal capacity over a broad 
interval of temperatures is illustrated in Fig. 84. 
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Numerous events occur that cannot be predicted. These are called 
random events. The height of a young man appearing for military 
service, the number of pedestrians passing a particular crossing 
during certain hours, and the number of winning tickets in a loan 
lottery falling on each series of one hundred bond numbers are all 
examples of random events. The results obtained by observing numer- 
ous events of a single type, for example, measuring the height 
of a large number of young people, counting the number of pedestri- 
ans per minute over a large number of days, or analysing the number 
of winning tickets for a large 
number of loan lotteries, 
may be summarised in the 
form of a so-called distri- 
bution curve. In the case of 
the height of a person, the 
data may be processed in 
the form of numbers indi- 
cating the number of men 
called up for military serv- 
ice whose heights are be- 
tween 170 cm and 171 cm, 
between 171cm and 172 cm, : 
etc. Thus, the probability Random quantity 
of observing a person among 
those called up who has pre- 
cisely a given height (e. g., 
171.34 cm) is practically equal to zero. Therefore, it is better to 
refer only to the number of men called up having a height lying in 
a particular interval. . 

In the case of the analysis of the prize list, the distribution curve 
may be constructed on the basis of the data for the number of series 
of one hundred bonds for which there was not a single winner, for 
which there was one winner, two winners, etc. 

If we construct a graph, plotting the random quantity (e. g., 
height, number of pedestrians or number of winners) along the horizon- 
tal axis and the number of random events (e. g., number of people 
having a height lying in a particular interval, the quantity of cases 
of a given number of winners per one hundred numbers, etc.) along 
the vertical axis, the obtained curve is a distribution curve. An 
example of such a curye is shown in Fig. 85. The curve is drawn 
through the mid-points of the tops of the rectangles. Each rectangle 
has an‘area equal to the number of times a random event occurred 
for the quantity lying in the given interval. 


Number of random events 


Fig. 85 
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The remarkable feature of distribution curves is their reproduci- 
bility. If we construct distribution curves analysing the height of 
young men called up for military service for a number of years, it 
will be seen that the curves are entirely similar. This similarily 
will not be found if we study the height distribution curves construct- 
ed on the basis of a small number of measurements. As we increase 
the number of measurements on which each curve is based, the curves 
for different years will become more and more similar. The same 
holds true for the distribution curves of all events, provided the 
events are random and the conditions of the obtained curves do not 
change. 

We call the distribution law of one or another quantity a statis- 
tical law. It is more accurately given the greater the number of events 
used to determine each ordinate of the curve. 

Naturally, knowing the distribution curve does not enable us to 
predict the number of a bond that will win in the next lottery. How- 
ever, we can say, for example, what portion of the series consisting 
of one hundred numbers each will have one winner. The greater the 
number of bonds used in the analysis, the greater the accuracy of 
this prediction. 

In view of the large number of molecules contained in very small 
volumes of matter, all kinds of statistical predictions about the 
behaviour of molecules are made with particularly high accuracy. 
A distribution curve of one or another random quantity plotted for 
the molecules of a substance will be reproduced with tremendous 
accuracy because each “rectangle” of the distribution curve corre- 
sponds to thousands of millions of molecules. 


72. Boltzmann’s Law 


Certain ideas about the distribution of molecules follow immedi- 
ately from the chaotic nature of thermal motion. This applies to 
the velocity distribution of the molecules according to direction and 
to the volume distribution of the molecules for the case when no 
forces act on the gas. However, there are numerous cases when the 
consequences of the assumption regarding the chaotic nature of 
thermal motion are not evident in advance. 

First, there is the question of the distribution of the molecules 
according to speed. What percentage is moving rapidly, and what 
percentages are moving with average or slow speeds? Then, there 
is the problem of determining how a uniform density distribution 
of molecules changes when the gas is placed in a field of force, say 
in a gravitational field—or, if the molecules have electric or magnetic 
properties, in an electric or magnetic field. Boltzmann’s law, which 
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may be derived by means of the theory of probability, gives the 
answers to these and similar questions. 

Let us consider a small volume of space—a cube at point z, y, 2 
whose sides are Az, Ay, As. Assume a considerable number of mole- 
cules to be contained in this cube. We shall consider those molecules 
having velocity components in the ranges from vy to vy + Avy, vy 
to v, + Av, and v, to v, + Av,. The magnitudes of Av,, Av, and 
Av, are such that a large number of molecules are contained in the 
indicated interval ofj velocities. This is necessary in order to be able 
to apply the laws of statistical physics to these small volumes (phys- 
ically, infinitesimal volumes). In the future, we shall say that such 
molecules have coordinates in the neighbourhood of x, y, z and veloci- 
ties in the neighbourhood of vx, vy, Vz. We repeat, to speak of a quan- 
tity of molecules that have exactly a given velocity is impermissible, 
for the probability of encountering such a molecule is infinitely 
small. Since the kinetic energy of a molecule is determined by 
the value of the velocity and the potential energy of a molecule in an 
external field depends on the coordinates of the molecule in space, 
all the molecules segregated by us have, practically, one and the 
same energy 6. 

Boltzmann’s law, based on considerations developed in courses 
on theoretical physics, gives a general expression for the number of 
molecules whose coordinates are in the neighbourhood of z, y, z 
and velocities in the neighbourhood of vx, vy, vz. This number is 


An = Ae~€/*T AxAyAzAv,Av,Avz, 


where A is a constant that may be determined for a concrete problem, 
T is the absolute temperature and & is Boltzmann's constant. 

The energy in the exponent is equal to the sum of the kinetic 
energy of translation of the molecule and the potential energy of 
the molecule in the external field: 


2 
6=—4+U. Hence, 


= +U 
An=Ae  "T AxAyAzAv,Av,Av,. 

This formula also applies to the case when the molecule possesses 
other forms of energy as well, for example, rotational and vibration- 
al. These components of the energy must then be included in @. 

Boltzmann’s law or, as it is also called, Boltzmann's distribution, 
shows that the largest energy corresponds to the lowest number of 
particles whose velocities and coordinates lie in the given interval. 

We shall apply Boltzmann's law to the solution of important prob- 
lems related to the height distribution of particles and the velocity 
distribution of molecules. 
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73. Distribution of Particles with Respect to Height 
in a Gravitational Field 


If, in a liquid, there are a large number of small particles that are 
heavier than the liquid and do not dissolve in it, al first glance it 
may appear that sooner or later these particles must fall to the bol- 
tom. This, however, does not occur—but it would if there were no 
thermal motion. 

Thus, the force of gravity altracts the particles downwards, but 
the chaotic thermal motion, an inherent property of all particles, 
will continuously impede the action of the gravitational force. 
A particle moving downwards may experience a collision on the way 
that hurls it back upwards. It again begins to move downwards and 
again a collision may hurl the particle upwards or sidewise. While 
some particle may succeed in reaching the botlom of the vessel, 
another particle, on the other hand, may be raised from the bottom 
by random impacts and brought to the upper layers of the liquid 
by random impulses. It is quite understandable that as a result 
some nonuniform distribution of particles is established. In the 
upper layers there will be the least number of particles, while at 
the bottom of the vessel there will be the greatest number. The 
heavier the particles and the lower the temperature, the more 
will the height distribution of particles be “compressed Loward the 
bottom”. “aan” 

The quantitative aspect of this interesting phenomenon, occur- 
ring for all particles located in a gravitational field (molecules of 
a gas or particles of an emulsion suspended in a gas or liquid) be- 
comes clear from Boltzmann’s law: We may rewrite the exponential 
factor in the formula for the Boltzmann distribution in the form 

mr2 mgh 
e Te RT. 


? 


U, the potential energy of gravitation, has been replaced by the 
expression mgh. Now, we are interested in the number of molecules 
(of all velocities) located at a height between hk and kh + Anh. It is 


_ mgh 
An=nge *? Ah. 
Here, the coefficient of proportionality 7» is simply the specific num- 
: A 
ber of particles an 
cles decreases with increasing height. 

The form of this relation confirms the correctness of the assertion 
made above that the greater the mass of the particles and the lower 
the temperature, the more rapidly does the curve fall. It is also 
evident from the curve that its rate of decrease depends on the gravi- 


at h = 0. Fig. 86 shows how the number of parti- 
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tational acceleration. On different planets, the distribution of par- 
ticles with respect to height will differ. 

According to the above formula, at least a small number of 
molecules exists al every height above the Earth’s surface. This 
means that molecules may recede from the Earth and fly into space, 
for it is not excluded that as a result of random collisions one or 
another molecule will attain a velocity of 11.5 km/sec, which is suf- 
ficient, as we know, to escape 
from the Earth’s gravitational 
pull. It may, therefore, be stat- 
ed that the Earth is gradually 
losing its atmosphere. How- 
ever, calculation of the rate of 
dispersion of the atmosphere 
Shows that it is negligible. 
During the entire existence of 
the Earth, an insignificant 
amount of air has been lost. 
The situation is different as ; 
regards the Moon, where the: 
velocity required to overcome Fig. 86 
gravity is ~2 km/sec. Such a 
small velocily is very easily 
attained by molecules and as a result the Moon has no atmosphere. 

The formula giving the number of particles as the function of 
height may be rewritten for the density of a gas or for the pressure 
of a gas. Since the gas pressure is proportional to the number of 
particles in a unit volume, the formula may be written in the form 


An 
4h 


p=poe *F 


Here, pp is the pressure at zero level. This formula is called the 
barometric formula. It is used by meteorologists measuring atmos- 
pheric pressure at high altitudes to reduce the results of their 
measurements to “sea level”. 


It is necessary to note yet another important application of the formula 
for the distribution of particles with respect to height, which was used for the 
experimental determination of Avogadro’s number by Perrin, the French 
scientist. In accordance with the conditions of the experiment, Perrin had to 
somewhat modify the formula for the distribution of molecules with respect to 
height. He studied an emulsion obtained by dissolving gutta gamba (a variety 
of resin) in water. Using a microscope, an entire mound of spherical granules 
could be observed in the emulsion. Perrin used a centrifuge to sort the gutta 
gamba granules according to size. Several months of labour yielded 20-30 gm 
of gutta gamba granules having a diameter of 0.74 microns. The density of 
gutta gamba is D = 1.195 gm/cm%, i.e., the mass of one grain was equal to 
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7 X 10-4 gm. Exact determination of the dimensions of the grains was no casy 
task. Perrin made this determination using three independent methods: 

1) The length of a chain of several dozen contiguous grains was deter- 
mined under a microscope. 

2) The weight of several thousand grains was measured and the dimension 
calculated from the known density of gutta gamba. 

3) Stokes’ formula (see p. 221) was used to determine the dimension from 
observations on the velocity with which a cloud of grains sinks in an emul- 
sion. It was assumed that, in accordance with Archimedes’ principle, a grain 


‘ 3 4 : . 
sinks under the action of the force san (D — d) g, where d is the density 


of the liquid and r is the radius of the grain. When the grain sinks uniformly, 
this force is balanced by the force of viscous friction calculated by Stokes’ 
formula. From this condition, it is easy to determine r. 

There was close agreement between the results of all three methods. This 
signified that the effective weight of a microscopic granule floating in a liquid 
may be written in the form mg (1-+) . Recalling that kat » Wwe obtain 
the following barometric formula for an “atmosphere” of gutta gamba grains 
floating in water: 


= Nmgh po) 

n=nge BF D? 

The experiment reduced to the determination of the ratio of concentrations n 
at equal levels. This was accomplished by focussing the microscope on suffi- 
ciently thin layers of the emulsion and calculating the number of particles 
in the field of vision for equal intervals of time. Perrin changed the viscosity 
of the emulsion by a factor of one hundred and observed that the ratio of con- 
centrations exactly agreed with the barometric formula. Substituting the values 
of no, n, h, m, d, D and T, it was possible to determine NV. It turned out that, 
in spite of the large changes ‘in the viscosity of the emulsion and the dimen- 
sions of the grains, N determined in this manner agreed excellently with the 
values predicted by the kinetic molecular theory. Perrin obtained 6 x 1073 << 
< N <7 X 105, while according to modern data NV = 6.02472 x 1078. This 
was highly reliable evidence that the Boltzmann distribution according to 
energy is applicable even to particles having a gram molecule (mass of N par- 
ticles) equal to 50,000 tons! 


74. Velocity Distribution of Molecules 


The velocity distribution of molecules, first determined theoreti- 
cally by Maxwell, an outstanding English physicist, may be consid- 
ered to be a consequence of Boltzmann’s law. 

According to Boltzmann’s law, the number of molecules whose 
velocities are in the interval from vy, to vy + Avy, vy to vy + Avy, 
and v, to v, + Av, is 

mv2 


An=Ce 2*T Av,Av,Av,z. 


It is implied that we are interested in the velocity distribution in 
a small volume of gas and that the space distribution of the mole- 
cules is taken into account by the constant factor C, which does not 
interest us at the moment. 
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The above formula takes account of the distribution of the mole- 
cules with respect to the magnitudes as well as the directions of the 
velocities. However, we already know the distribution with respect 
to the directions—the number of molecules moving in one or another 
direction must be the same for complete randomness in the molec- 
ular motion. We are interested in the number of molecules having 
a speed from v to v + Av, where 

v=Vvi+u,+v%. 

If we construct a three-dimensional diagram, along whose axes 
Vx, Vy, Vz, the projections of the velocities of the molecules, are 
plotted, and consider this space to be divided into infinitely small 
cubes of volume Av,Av,Av,;, the data on the velocity distribution 
of molecules may be simply represented as the numbers of molecules 
contained in a cube. Boltzmann’s formula gives us the number of 
molecules for each one of the cubes. However, examining the for- 
mula, we see that the number of molecules is the same for all cubes 
located within a spherical shell of radius v to v + Av, for only the 
absolute value of the velocity enters in the exponential factor of the 
formula. The number of molecules having velocities in the range 
from v to v + Avis proportional to the volume of the spherical shell, 
i. e., 4nv*Av. Thus, if the number of molecules contained in one 
cube is equal to 


mre 
Ce 2kT Av,Av,Avz, 


the number of molecules contained in the spherical shell, i. e., 
possessing velocities in the range from v to Av, is represented by the 


formula 
mv2 


* An=Ce 2#T 4nv*Av. 


What then is the nature of this dependence? At v = 0 and v = oo, 
the number of molecules is equal to zero. It is evident that the curve 
must have a maximum. Let us determine in the usual manner the 
Maximum of the factor preceding Av. Taking the derivative of this 
expression and setting it equal to zero, we obtain 


mv2 


£ (e 24T y2)—0. 


Hence, the value of the velocity for which the distribution function 
1as a maximum is 
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What can be said about this velocity? Since the number of mole- 
cules having the velocity v are plotted along the ordinate of the 
distribution curve, c is a peculiar boundary. Molecules moving with 
velocities greater or less than ¢ are encountered less often than mole- 
cules of velocity c. 

This velocity is called the most probable. The velocity distribu- 
tion curve for gas molecules (Maxwell distribution) is shown in 
Fig. 87. 

It is interesting to compare the formulas for the most probable 
velocity and the r-m-s velocity: 


2kT 3kT 
c= ai and Urms = Fae 


We see that the r-m-s velocity is greater than the most probable. 
The reason for this is evident from the form of the distribution curve. 
Since the curve extends far 
to the right, the root-mean- 
square velocity is displaced 
in that direction. 

Let us cile several figures 
characlerising the velocity 
distribution of gas mole- 
cules. The number of mole- 
cules with velocities close to 


aS 
s|s 


C Vins of the most probable velocity 
: c is 1.1 times larger than 
Fig. 87 the number of molecules 


with velocities close to the 
root-mean-square value, 1.9 times larger than the number of mol- 
ecules with velocities close to 0.5c, and 5 times larger than the 
number of molecules with velocities close to 2c (see Fig. 87). 


75. Measurement of the Velocities of Gas Molecules 


Even though the law of molecular velocity distribution is based 
on exceptionally well-founded theoretical grounds, whose validity 
is confirmed by a large number of physical facts, it is interesting to 
subject the distribution formula to direct experimental verification. 

The velocity of gas molecules can be measured in a volume only 
by indirect means. If a molecule radiates light, the velocity of its 
motion affects the width of the spectral lines (Doppler effect). 

Direct means of measurement require molecular beams. For this 
purpose, a long tube of large diameter is partitioned by two shulters 
having very small apertures. The gas is placed in an end compart- 
ment, whereupon the molecules begin, at first, to penetrate into the 
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middle compartment and will sometimes even reach the compart- 
ment at the other end. Clearly, only those molecules whose vector 
velocilies are directed along the axis of the tube when passing through 
the first aperture can traverse the entire length of the tube. Thus, 
a molecular beam is separated out from the gas. The velocities of the 
beam molecules all have the same direction. It is evident, however, 
that due to the random motion of the molecules, the distribution 
with respect to speed will be the same as for the molecules of any 
other direction of motion. 

To measure the velocities of the beam molecules, we can resort to 
an arrangement reminiscent of an apparatus used for measuring the 
velocity of a bullet. Such an apparatus has two cardboard disks 
rigidly fixed on a shaft and rotate about it with a velocity ow. If 
the bullet travels parallel to the axis of rotation, the disks will be 
consecutively pierced at two points displaced in azimuth by an angle 
( with respect to each other. This angle corresponds to the rotation 
of the system while the bullet traversed the distance 7 between the 


disks. The rotation time for angle @ is equal to = . Hence, the velocity 


of the bullet is 
v=l:te= ia 
(0) (Pp 

Since molecules cannot pierce disks, the analogous experiment 
for molecular beams is performed with disks in which slits are cut 
along radii. The angular distance between the slits is equal to 9. 
Clearly, molecules of velocity v can pass through two slitted rotating 
disks only for a specific angular velocity o, satisfying the condition 


2 = £, Thus, by varying o, we can filter the molecules according 


to their velocities, collect molecules having the same velocities and 
measure their relative quantities. 

The velocity distribution formulas discussed above, and hence the 
formulas for the r-m-s and most probable‘values of molecular 
velocities, have been verified by numerous experiments. 


* 76. Probability of a State 


Let us consider a box divided into two equal parts by a partition 
in which an aperture has been cut. If there are molecules of gas in 
the box, they may be transferred from one half of the box to the other 
as the result of random collisions with the walls of the container and 
with each other. 

In spite of the fact that the molecular motion is completely hap- 
hazard, a method exists for predicting how many molecules will be 


206 Kinetic Theory of Gases 


in the left half of the box and how many in the right half. This meth- 
od is based on the application of the theory of probabilily. 

If there were one molecule in the box, the chances or, as we say, 
the probability, that the molecule is in the right-hand portion is 
the same as that it is in Lhe left-hand portion. Since, in all, there are 
two possible cases (the molecule is either in the left-hand or in the 
right-hand portion), and we are interested in the realisation of one 
of these cases, the probability of the molecule being in one half of 


the box is said to be equal to + . Now, assume that there are two mol- 


ecules in Lhe box, designated by the figures 1 and 2. In all, there are 
now four possible dispositions—both molecules on the left, both on 
the right, molecule No. 1 on the left and No. 2 on the right and, 
finally, No. 2 on the left and No. 4 on the right. We are interested 
in the probability of finding two molecules on the left. This is one 


case out of four possible ones, so that the probability is equal to + . 


i.e., (=) - For three molecules, the situation is as follows: 


left 1, 2, 3 0 1,2 1,3 2,3 3 2 1 
right QO 1, 2,3 3 2 1 1,2 1,3 2,3 


Clearly, the probability of all three molecules being on the left is x , 


1s Os (+ - It is not difficult to see that, for the case of N molecules, 


the probability of all the molecules being in one part of the box is 
equal to (> a Whenever another molecule is ‘added, it is always 
possible to place it either in the left-hand or in the right-hand part. 
Therefore, with each newly added molecule, the probability of the 
molecules being in one half of the container is obtained by dividing 
the preceding probability by two. 

When the number of molecules is still no more than one hundred, 


; 1\N., / 
the quantity s) is already so small that we need no longer take 


account of the possibility that all the molecules will be located in 
one half of the container. However, the number of molecules in 
a cubic centimetre of gas is not one hundred, but about 10”. If the 
container is considered to be divided into two parts, the probability 
that the molecules will all turn up in one half of the vessel is equal 
4, 1020 ; ; 
to (=) . By taking the logarithm, this number may be converted 
into the form 10-3". To put this number in decimal form, 
3 x 10 zeros must be written! A person writing at a rapid speed of 
threé zeros per second will require 10!° sec to write this number. 
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This is equivalent to 300,000 million years, which is ten times the 
amount of time our solar system has been in existence. 

Let us return to the table for the disposition of three molecules. 
Only for one disposition out of eight do all the molecules turn up on 
the left. Every other disposition is also encountered one time out of 
eight. It should be remembered, however, that the molecules are 
arbilrarily numbered and there is no way of differentiating a dispo- 
sition in which Nos. 4 and 2 are on the left from one in which 2 and 
3, or 4 and 3, are on the left. Thus, compared to the one disposition 
in which there are three molecules on the left, there are three dispo- 
sitions in which there are two molecules and the same number of 
dispositions in which there is one molecule on the left. Therefore, 
the probability of some characteristic distribution existing, regard- 
less which particular molecules produce it, may be measured by the 
number of dispositions that can produce the distribution. The great- 
er this number, the more frequently will such a distribution be 
encountered, i. e., the more probable will this distribution be. 

The smallest piece of matter consists of a tremendous number of 
molecules. In their haphazard motion, the molecules are endlessly 
changing their location and velocity. However, if the substance is. 
in thermal equilibrium with the surroundings, the measurable 
macroscopic characteristics of the substance (i. e., the pressure, 
temperature or energy, etc.) remain unchanged. One and the same 
state is achieved at every instant in another manner, i. e., the mole- 
cules are located differently (with respect to the adopted numera- 
tion of the molecules) and the velocities are different each time 
(again with respect to the numeration). One and the same state is. 
achieved by means of a tremendous number of dispositions of the 
numbered molecules, but the space and velocity distribution of the 
number of particles remains unchanged. 

The number of dispositions of the molecules leading to one and 
the same space and velocity distribution and, hence, to one and the 
same state of a substance relative to the total number of disposi- 
tions of the molecules is called the probability of the state. 

If there are only several molecules in a volume under considera- 
tion, the state of this system will continuously change. Not only 
will there be probable states, but sometimes less probable states will 
also be encountered. Thus, in the case of three molecules, all the 
molecules will be in one part of the vessel, on the average, in one 
case out of eight. However, as the number of molecules increases, 
the “weight” of the most probable state also increases. The probabil- 
ity of achieving the more probable states becomes immeasurably 
greater than the probabilities of other states. To make this clear, let. 
us return to the example of the molecules in a box, but assume that 
the box is divided into a thousand million cells. What is the proba- 
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bility that N gas molecules are distributed in all the cells except 
one. Using the same reasoning as previously, it is not difficult to 
conclude that this probability is equal to (0.999999999)*. At first 
glance, it may appear that this will yield a value close to unily. 
However, taking NV = 10, the approximate number of gas mole- 
cules in one cubic centimetre, we find that the probability is equal 
to 10-#%2019, 

This calculation shows that the probabilities of the most probable 
states of a substance and the probabilities of improbable states are 
entirely incommensurable quantities. That is why a stale of thermal 
equilibrium is macroscopically stable and the parameters of a sub- 
stance in a state of equilibrium do not change with time. 


77. Irreversible Processes from the Molecular Viewpoint 


The example of the previous article shows quite clearly that the 
state in which the molecules are distributed “uniformly” is the most 
probable. Any deviation from “uniformily”—displacement of one 
portion of the molecules to the left side of the container, disposition 
of the faster molecules on the left, directed motion of a large portion 
of the molecules, in short, any deviation from haphazardness in the 
space and velocity distribution of the molecules—results in a decrease 
in the probability of the state. This conclusion enables us to 
comprehend the kinetic-molecular nature of the irreversibility of 
actual processes. 

As was established above, the second law of thermodynamics 
for irreversible processes, i. e., the law of increasing entropy in 
thermally isolated systems, is a generalisation of experimental expe- 
rience on the impossibility of certain processes. Thus, heat cannot 
be transferred from a cold body to a hot one without compensation, 
a body cannot acquire kinetic energy merely at the expense of a 
decrease in the internal energy of the surrounding medium, and 
a gas may expand of itself but not contract. 

The existence of irreversible processes is a peculiarity of molecu- 
lar phenomena. For a purely mechanical phenomenon, i. e., a proc- 
ess without friction, the process may always be reversed. When 
a pendulum moves to the right, it passes in reverse order through all 
the states passed in moving to the left. A billiard ball rebounding 
from the side of the table in some direction or other will, in turn, 
rebound from an elastic wall placed in its path and retrace in reverse 
order the entire path traversed in the “forward” direction. The com- 
plete equivalence of “forward” and “backward” is evident for purely 
mechanical processes. Why then do molecular processes, which we 
have considered as the totality of the movements of the molecules, 
not possess the property of reversibility. There is only one reason. 
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In all irreversible processes, the probability of the state increases. 
A reversible process is a conceivable process, i. e., its implementation 
is in principle possible, but for the time available to a person for 
observation such a process is, practically speaking, improbable. 

This is not difficult to show for any irreversible process. Heat 
transfers from a hot body to a cold one, but not vice versa. In the 
case of gaseous bodies, such a process may be visualised as a mixing 
of fast molecules with slow ones. The reverse process cannot occur 
according to the random law, for it would constitute a sorting out of 
fast and slow molecules, i. e., a transition to a more orderly state. 

For the same reason, using a shovel, we can quite rapidly mix the 
contents of two sacks of different grain. However, we can continue 
to mix the contents of these two sacks endlessly without the grain 
separating in such a manner that one of the grain varieties appears 
above and the other below. It should be realised, moreover, that the 
number of kernels in the sacks is immeasurably less than the number 
of molecules in a cubic millimetre. 

It is also easily seen that the reverse process of spontaneous 
expansion of a gas is completely improbable. If in the partitioned box 
considered above there is gas on the left and vacuum on the right, 
both parts of the box will be uniformly filled with gas within a short 
time. In principle, it could occur that all the molecules turn up 
together again on the left. However, the probability of such an event 


- 1\N 
is extremely small. Its value has been shown to be equal to (=) ‘ 


No matter which irreversible process we consider, the result will 
always be the same—each irreversible process is accompanied by an 
increase in the probability of the state. 

Thus, there are two quantities that increase during irreversible 
processes—the entropy S, with which we are already familiar, and 
the probability of the state W, which we have just discussed. It 
is natural that these two physical quantities should be related. Boltz- 
mann showed that such a relation does, in fact, exist. The formula 
given by him has the form S = & ln W, i. e., the entropy is propor- 
tional to the logarithm of the probability of the state. 

The second law of thermodynamics thus acquires still another 
formulation: In reversible processes, the probability of the state 
does not change, while in irreversible processes (we are referring to 
closed systems) the probability of the state increases. 


78. Fluctuations. Limits to the Application of the Second Law 


All physical properties remain unchanged if the space and velocity 
distributions of the molecules do not change. In principle, the distri- 
bution of the molecules of a substance may change with time. How- 


14-1409 
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ever, as we have indicated above, the most probable distributions 
stand out so sharply that deviations from these distributions must 
be considered to be very rare events. The physical characteristics 
corresponding to this most probable distribution are called the 
average characteristics. Practically, the deviation of a measured 
physical characteristic from its average value for systems having 
large numbers of molecules is impossible to observe. This is the sit- 
uation when the physical properties are being considered for vol- 
umes containing large numbers of molecules. However, if the number 
of particles in a system is small, it is also possible to observe rarer 
space and velocity distributions of molecules. The values of the 
physical characteristics corresponding to these rarer distributions 
differ from the average values. These deviations of the physical 
characteristics from their average values. which occur in systems 
having a relatively small number of particles, are called fluctuations. 
All properties of volumes containing small numbers of molecules— 
e. g., temperature and pressure, thermal capacity and thermal con- 
ductivity—are subject to fluctuations about the average val- 
ues. 

This question may be approached somewhat differently. If a tiny 
mirror suspended from a thin string is placed in a gaseous medium, 
then, from the macroscopic standpoint, the pressure of the gas acting 
on the mirror cannot manifest itself, for the forces ac¢ton all sides 
equally. In principle, from the molecular standpoint, the changes 
in momentum due to the impacts of the molecules on the mirror do 
not necessarily have to balance for the different portions of its sur- 
face. A light mirror may thus begin to execute fluctuational vibra- 
tions. As was stated above, for any particle (molecule, Brownian 
particle, pea), the energy of thermal, random notion is equal to 


+ kT for one degree of freedom of motion. And this is the energy, on 


the average, falling on the mirror. On the other hand, the work of 
rotating the string by an angle Ag is equal to A/Ag. Therefore, angu- 


: eat kT F 
lar deflections of the order of magnitude Aq@ iar will occur quite 


often. 

Such fluctuations are indeed observed and their measurement may 
be used for the experimental determination of Boltzmann’s constant 
and, hence, of Avogadro’s number. 

Fluctuational effects limit the accuracy of measurements. A point- 
er, mirror, or any other indicating device is subject to fluctuations. 
At room temperature,the accuracy limit in energy units is about 
407° joule. In the construction of many instruments such accuracy 
has not yet been achieved, but in some of the best measuring devices 
jt already has. 


78. Fluctuations. Limits to the.Application of the Second Law 11 


Fluctuations limit the applicability of the second law of thermo- 
dynamics. They represent processes in which the system passes from 
a more probable state to a less probable one, i. e., processes in which 
the entropy decreases. 

This is excellently illustrated by Brownian movement, where the 
pressure fluctuations occur in small volumes and affect individual 
particles. Due to these random pressure vibrations, a particle may, 
for example, be impelled upwards. However, motion against the’ 
force of gravity requires work. In this case, the work is performed at 
the expense of the random, thermal motion of the molecules, i. e., 
only at the expense of the internal energy of the substance, which 
is at complete variance with the second law of thermodynamics. 

Although phenomena for which the entropy decreases occur at 
times in individual small volumes, i.e., the second law of thermody- 
namics is contradicted, the system as a whole will always obey this 
law. Due to the randomness of the events, the number of processes 
occurring at the expense of the internal energy will be exactly equal 
to the number of processes occurring in the reverse direction. It can 
be rigorously proved that any attempt to create a perpetual engine 
of the second kind by “selecting” in individual small volumes proc- 
esses that contradict the second law will end in failure. 

The second law of thermodynamics has a limit at the other end of 
the scale as well. In addition to being inapplicable to systems having 
a very small number of particles, it loses validity for systems having 
an infinitely large number of particles, namely, the universe or any 
of its infinitely large components. As was explained above, the es- 
sence of the second law of thermodynamics consists in the fact that the 
number of equilibrium states is overwhelmingly greater than the 
number of nonequilibrium distributions. However, for the universe, 
which consists of an infinitely large number of particles, this state-_ 
ment loses its meaning, for the number of equilibrium states and 
the number of nonequilibrium states are both infinitely large. Con- 
sequently, for the universe as a whole, one cannot speak of the differ- 
ences in the probability of states. , 

This is confirmed by general philosophical and physical considera- 
tions first advanced by Engels and Boltzmann, respectively. As 
a matter of fact, if we assume that the second law of thermodynamics 
is applicable to the entire universe, then it naturally follows that 
the universe is approaching a “thermal death”. It implies that sooner 
or later “the most probable” state of thermal equilibrium will 
be established in the universe, whereupon further processes will 
cease. This notion, first advanced by Clausius, is at variance with 
the materialist view that the universe is eternal in time. 
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CHAPTER XIII 
PROCESSES OF TRANSITION TO EQUILIBRIUM 


79. Diffusion 


A body interacting with its medium changes its state so as to come 
into equilibrium with the bodies surrounding it: its internal energy 
tends to a minimum, while its entropy increases and becomes 
a maximum when equilibrium is established. These two tendencies 
are usually conflicting. As a result it is difficult to predict the 
effect when both energy and entropy are capable of changing. Let 
us now consider the phenomena of diffusion, thermal conductivity 
and internal friction occurring in closed systems. In other words, 
we are concerned with equalisation of the concentrations, tempera- 
tures and velocities of some parts of a body with respect to others. 
(Naturally, equalisation of velocities is meaningful only for liquids 
and gases.) Since the energy cannot change in such systems, the tran- 
sition to a state of equilibrium consists only in-an increase in entropy. 

The basic laws for the phenomena of diffusion, thermal conductiv- 
ity and internal friction are very similar. Let us first consider 
diffusion processes. It is immaterial whether we deal with the equal- 
isation of the concentration of a gas or a liquid. Our discussion will 
even be valid for solid solutions (see p. 618), since in this case too 
the tendency to maximum entropy makes the atoms or molecules of 
a substance intermix so that a single concentration is established 
in all parts of the body. 

Let us consider two physically close, infinitely small volumes of 
a substance whose concentrations of diffusing atoms (or molecules) 
are c and c + dc. If these two volumes are a distance dz apart, the 


. d ‘ Z peas ur 
ratio oa gives the rate of change of concentration. This ratio is called 
the concentration gradient. If the x-axis is chosen so that its positive 
direction coincides with the diffusion direction, then a will be 


a negative quantity. Substance will migrate in the direction of lower 
concentrations. _ 

This does not mean that all the molecules move in one direction 
in a continuous, uninterrupted stream. On the contrary, diffusion 
maintains to a considerable extent the haphazard features peculiar 
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to molecular motion. The molecules move haphazardly in all direc- 
tions, including the direction of greater concentration, but the 
probability of molecules being displaced in the “correct” direction 
is greater than for other directions. This means that through an area 
perpendicular to the flow more particles pass in the direction from 
greater concentration to less concentration than the other way round. 
The basic law of diffusion states that the flow of matter yp, ‘i-e., 
the mass of matter passing through a unit area per unit time, is 
directly proportional to the negative 
gradient of the concentration: 


do 
dx * 


= ae 
tana= 35 <0 


p= — 


D, the constant of proportionality, 
is called the coefficient of diffusion. 
The above relation is convenient 
Since the coefficient of diffusion is, 
within broad limits, a constant for 
a given substance and medium. 

In measuring the concentration 
and the flow of matter, the units 
should correspond. Thus, ifthe con- 
centration is measured in grams per 
cm’, the flow should be measured in 
grams per cm? per sec. We see then 
that the dimensions of the coefficient of diffusion are completely 
determined and in the CGS system are expressed in cm?/sec. 

A decrease in concentration usually follows a sagging curve as 
shown in Fig. 88. If we are interested in the portion for which the 
decrease in concentration may be represented by a straight line, then 


Fig. 88 


Co—C 
p= —Dp 21 , 
Ta— Ty 


where c, and cz are the values of the concentrations at points 2, 
and 22, respectively. 


The coefficients of diffusion vary within broad limits. 
For example: 
4) for gases at temperatures from 0° to 15°C: 


hydrogen > oxygen, D=0 
air > oxygen, D=0 
D=0 


778 cm2/sec; 
: 17 
air + carbon disulphide, 


78 cm2/sec; 
099 cm2/sec; 
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2) for solutions of blue vitriol diffusing in distilled water (where c is in 
gram equivalents per litre): 


c D (cm2/day) 
0.1 0.39 
0.5 0.29 
0.95 0.23 


80. Thermal Conductivity and Viscosity 


The temperature equalisation process is very similar to that of 
diffusion. If the temperature of a body differs at different points, the 
entropy is not a maximum. In order for equilibrium to be established, 
the average velocities of the molecules, and hence the temperatures, 
must become equalised. 

If the temperatures at two neighbouring points separated by a dis- 


tance dz are T and 7 + d7’, the ratio a expresses the rate of temper- 


ature drop and is called the temperature gradient. 

During the process of temperature equalisation, portions of the 
body having more energy give up energy to portions of the body 
having less energy. In a certain sense, heat “flows” from one portion 
to another. The amount of heat passing from one portion of the 
body to another through a unit area per unit time is called the 
heat flow q. Just as in the case of diffusion, one can assume that the 
heat flow is proportional to the negative temperature gradient. 
The greater the temperature difference, the more rapid the heat flow. 
The formula 


is also convenient here too since the constant of proportionality x, 
which is called the coefficient of thermal conductivity, is a constant 
for a given substance and does not depend on the magnitude of the 
heat flow. For a linear temperature drop, the formula assumes the 
simplified form 

T.—T 

tg—2y ° 


q=— 


It is not difficult to determine the dimensions of the coefficient 
of thermal conductivity. In the CGS system, this coefficient is meas- 
ured in cal/cm sec deg. It is evident from the formula that x is the 
heat flow per second through an area of 1 cm? when the temperature 
drops 1° over a distance of 1 cm. 


The values of the coefficient of thermal conductivity, just as in the case 
of the coefficient of diffusion, vary within broad limits. 
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For example: 

1) for solid bodies (0°-18°C): cork—0.00012, wood—0.0008, fused 
quartz—U.0033 and silver—1.06 cal/em sec deg 

2) for liquids: carbon bisulphide (14°C)—2 X 1074, sulphuric acid 30% 
(32°C)—62.4 X L074, mercury (0°C)—0.2 cal/em sec deg 

3) for gases (0°C): carbon dioxide—3.4 xX 1075, air—5.7 x 107-5, hydro- 
gen—40.G X 1075 cal/em sec deg. 


The third phenomenon of this class relates to the equalisation of 
velocities. If a gas or liquid moves in some direction or other so 
that different layers of the substance move at different velocities, 
the motion is unstable. Sooner or later, the velocities must become 
equalised—the slower layers are accelerated and the faster ones are 
retarded. This phenomenon is also called internal friction or viscosity. 

Let us consider a liquid or gas moving along an z-axis. Assume 
that different layers of the fluid are moving at different velocities. 
Along the y-axis, perpendicular to the direction of flow of the liquid 
or gas, take two close points separated by the distance dy. The veloc- 


‘ : - adv, 
ities of flow differ at these two points by dv. Thus, the ratio on is 


the gradient of the velocity and expresses the rate of change of the 
velocity as we move away from the surface of the fluid. To make 
this clear, consider a rapid stream. The velocity of flow is amaximum 
at the surface and gradually decreases as the bottom of the stream is 
approached. 

If at some instant we eliminate the causes of the fluid motion, the 
velocities of the different layers will begin to be equalised in accord- 
ance with the law of increasing entropy. In order for such an equali- 
salion to be possible, an internal frictional force must exist between 
the layers of the liquid or gas. The magnitude of this force per unit 
layer area is proportional to the gradient of the velocity, i. e., 


dv 
are ere 


Here, n is the coefficient of viscosity (or internal friction). Its dimen- 
sions in the CGS system are gm/cm sec. Such a unit is called a poise. 


The viscosity of different bodies varies within even broader limits than 
the two analogous coefficients considered above. For example: ; 

1) Solid bodies: glass (710°C)—4.5 X 1010, glass (420°C)—4 x 101, lead 
(9°C)—4.7 X 104, ice (—14°C)—8.5 X 10!4  poises. 

2) Liquids: ethyl ether (25°C)—0.0022, water (20°C)—0.01, glycerine 
(0.8% water, 18°C)—13.93 poises. 

3) Gases: hydrogen (0°C)—8.49 X 1075, air (0°C)—17.19 x 10-5 poise. 

It is interesting to note that hydrogen has one-half the viscosity of air and 
seven times its thermal conductivity. That is why hydrogen is used to cool 
powerful turbogenerators. 
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81. Rate of Equalisation 


We all know that the time it takes for equilibrium to be estab- 
lished varies within broad limits. The temperature of a red-hot piece 
of iron thrown into water and the temperature of the water become 
equalised rapidly. On the other hand, the temperatures of air and 
a hot brick become equalised slowly. Nitrogen diffuses almost instan- 
taneously in oxygen, while the equalisation of the concentrations 
of a solution of blue vitriol takes many days. Similarly, the equali- 
sation of velocities also varies within broad limits, depending 
on whether we are dealing with a gas or a viscous liquid. 

A universal formula for the time of equalisation cannot be given, 
for the geometry of the object affects the time. A cooling body may 
have the form of a cylinder or a plate; a diffusing gas may initially 
be within a small spherical volume or distributed over some surface; 
and the internal friction may occur in pipes of various cross-section 
or in open reservoirs. The particular circumstances must be taken 
into account in each case and exact calculation of the value for the 
time of equalisation is a difficult mathematical problem. However, 
we can abstract from the geometric details and try to solve the 
problem in general form if we abandon the aim of obtaining an exact 
formula and are content to merely determine the proportionality 
between the physical quantities. In this connection, it is helpful to 
consider the dimensions of the physical quantities that should he 
related to each other. 

Let us consider, for example, the phenomenon of diffusion. It is 
evident that ¢, the time of concentration equalisation, depends, in 
the first place, on the dimensions of the region in which the diffusion 
occurs (characteristic length Z) and the properties of the diffusing 
substance (characterised by the coefficient of diffusion D). The 


. : : l A : 
diffusion equation has the form p = —D = . The dimensional equa- 


tion is then 


M M 
pr =lPl zr - 


2 2 
We see that T =F , i. e., the time of equalisation ¢ = K= and 


does not depend on the concentration. 

Therefore, the following conclusion may be drawn: Every rigorous 
solution determining the time of concentration equalisation for 
diffusion processes will always yield the equation 

L2 


De 
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where A is a constant, dimensionless quantity depending on the 
geometric conditions of the problem. The quantity Z, on whose 
square the speed of concentration equalisation depends, refers to the 
geometric dimension of the region in which the equalisation occurs. 
Thus, if the concentration becomes equalised within the limits of 
One centimetre in, say, 10 sec, then, within the limits of two cen- 
timetres, it will become equalised in 40 sec. 

We can solve the problem of temperature equalisation in exactly 
the same manner. The basic relation of this phenomenon includes 
the following quantities: heat, coefficient of thermal conductivity, 
temperature and distance. However, the increment of heat per unit 
volume may be written in the form 


dq=pc,daT, 


where c,» is the specific heat at constant pressure and op is the density 
(thus, cpp is the thermal capacity of a unit volume). Therefore, the 
following quantities must be related to each other: temperature, 
length, time, density, thermal capacity and thermal conductivity. 
It is easily verified that the time ¢ cannot depend on the tempera- 
ture and can only be expressed by the ‘other quantities as follows: 


L2pcp 
x 


Thus, the time of temperature equalisation is expressed by the for- 
mula 
ta 


? 


st 
x |S 


wliere 7 designates the combination of constants pat The quantity 


x is called the thermometric conductivity and has been introduced in 
order to put the formulas for the equalisation of concentration and 
temperature in similar form. The coefficients of diffusion and ther- 
mometric conductivity have the same dimensions and are complete- 
ly analogous in the two equalisation phenomena considered. 

We thus see how the cooling of a body is determined. The greater 
the density and thermal capacity, and the smaller the coefficient of 
thermal conductivity, the slower the process. 


Example. Let us compare two rods of identical dimensions made of fused 
quartz and silver, respectively. For quartz, x = 0.0033 cal/em sec deg, p = 
= 2.65 gm/cm, and cp = 0.1844 cal/gm deg, i.e., ¥ = 0.676 X 10-2 cm?/sec. 
For silver, x = 1.06 cal/em sec deg, p = 10.5 gm/cm? and cp = 0.0558 cal/gm 
deg, i.e., y = 1.71 cm2/sec. Thus, the equalisation of temperature in the quartz 
rod takes 253 times longer than in the silver rod. 


Just as for diffusion, equalisation of the temperatures is character- 
ised by a dependence on the square of distance, i.e., the time of 
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equalisation is proportional to the square of the linear dimension 
of the region. 

Without going through an analogous procedure, let us write the 
formula for the time of velocity equalisation for the various parts of 
a liquid or gas. It is not surprising that the form of this relation is 
similar, namely: 


2 
pes 
v 


The coefficient v, which determines the rate of equalisation of the 


velocities, is equal to a and is called the kinematic viscosity. 


Example. For water, 1 = 0.0! poise and p= 1 gm/cm’, i.e., v= 
= 0.01 cm?/sec; for glycerine, n = 13.9 poises and p= 1.25 gm/cm%, i.e., 
v = 11.1 cm?/sec. This means that if a disturbance in glycerine is equalised 
in 0.1 sec, the same disturbance in water will be equalised in about 2 minutes. 


82. Steady Processes 


If a body is left undisturbed, the differences in temperature con- 
centration and velocity of the various parts of the body will be equal- 
ised, to be sure, in accordance with the principle of increasing entro- 
py. However, it is also possible to have a state of a body for which, 
over a prolonged period, the flow of heat or matter, or the velocity 
distribution of various parts of the body with respect to each other, 
remains unchanged. Processes of this type are called steady processes. 
Naturally, in the case of a steady process, the body is not in a state 
of equilibrium, 

Under what conditions are such processes possible? Let us consider 
a metallic rod to which at each instant of time, a certain quantity 
of heat is supplied at one end of the rod, while the other end is in 
thermal contact with a cold body. The condition under which the 
temperatures along the rod will not change, i.e., the condition of 
constant temperature gradient along the entire path of heat flow, is 
that the quantity of heat absorbed by the cold body be exactly equal 
to the quantity of heat supplied by the hot body during the same 
period of time. ‘ 

Under analogous conditions, a steady diffusion process is also 
possible. To create such a process, a certain quantity of matter must 
be supplied to one part of a body and the same quantity must be 
removed from another part. In this manner, a constant difference of 
concentration is maintained between two parts of the body. 

A steady viscosity process may be obtained, for example, in the 
region between two coaxial cylinders rotating at different velocities. 
Since close to the solid surface the liquid or gas has the same veloc- 
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ity as the solid wall, a constant velocity gradient is created within 
the fluid. 

Steady processes do not arise immediately. A certain amount of 
time must elapse for such processes to become established. 

Let us assume that one end of a rod that transmits heat is placed 
in snow. At the initial instant, the temperature of the rod is equal 
to zero at all points. If the other end of the rod is then brought into 
thermal contact with boiling water, the temperature begins to rise 
throughout the rod, but, of course, not at the same rate at all points. 
Almost immediately, a high temperature is established at the end 
of the rod that is in contact with the boiling water. The temperature 
of the end of the rod that has been placed in snow will rise slowest. 
After a certain period of time, the temperature will cease rising at 
all points of the rod and a definite temperature distribution becomes 
established, i.e., the process becomes steady. The nature of the tem- 
perature distribution depends on the amount of heat supplied 
(or removed) per unit time. 

In an electric iron heated by a spiral element, the highest temper- 
ature is in the central region and the temperature gradua ly drops 
towards the outer edges. Naturally, the air immediately surrounding 
the iron is hottest. With increasing distance from the iron, the tem- 
perature falls more rapidly owing to the low thermal conductivity 
of air. 

In the case of small bodies in air or liquid, the temperature curve 
need not be considered in rough calculations, i.e., it suffices to deal 
with the temperature difference 7 — TJ between the body and the 
medium. The heat flow per unit time from the body to the medium 
may then be assumed to be proportional to this temperature differ- 
ence: 


q=k(f—T)). 
The coefficient & is called the coefficient of thermal output and is an 
important engineering quantity. In courses on heat engineering, 
values for this coefficient are determined and related calculations 
discussed. ; 

Let us designate by P the power supplied to a body, e.g., electric 

power in the case of an electric iron. The condition for a steady proc- 
ess requires that 

P=k(T—T,). 
Here, 7 is the body temperature established in this steady process: 
T=T)+ - . It may vary considerably, depending on the power 


supplied and the conditions of heat exchange. : 
It is appropriate at this point to comment on the temperature 
indicated by a thermometer placed “in the sun”. The thermometer 
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is involved in the steady process of transferring solar heat to the 
surrounding air. Depending on the value of the coefficient of thermal 
output, the thermometer lying in the direct rays of the sun may 
indicate any value whatsoever. The temperature measured under 
such conditions is the temperature of the thermometer and is in no 
way an indicator of the weather. 

We shall not consider the analogous diffusion problems. 


83. Motion in a Viscous Medium 


Consideration of the dimensions of physical quantities helps to 
solve problems of tremendous practical importance. One such prob- 
lem is the steady flow ofa liquid or gas around an obstacle or, what 
amounts to the same thing, the steady motion of a body in a medium. 

The most important problem concerns the resistance force ex- 
perienced by a body in moving through a medium. This resistance 
force may depend on the body dimension L, the body velocity u, 
and properties of the liquid (or gas), namely, its density p and vis- - 
cosity y. Other quantities should play no part in this process. 

Let us first consider the dimensionless quantity comprised of L, u, 


p and 1. It will be recalled that the kinematic viscosity, v =— , has 


the dimension Z?7~-?. But the product Lu also has this dimension. 
Therefore, 


Ree plu lu 


n Vv 

is a dimensionless quantity. This quantity is designated as indicated 
and is called the Reynolds number. It can be shown that Re is, in 
effect, the only dimensionless combination of the indicated quanti- 
ties. Other dimensidnless quantities can only be functions of the 
Reynolds number, i.e., f (Re). If the motions of different bodies in 
different fluids lead to one and the same value for Re, the motions 
are said to be similar. A large technical field founded on the prin- 
ciple of similitude has developed. In this field, the characteristics 
of a phenomenon are determined on the basis of observations of a 
similar phenomenon occurring in a model. 

Let us now return to the problem raised above, namely, finding 
the expression for the resistance force experienced by a body moving 
in a medium. 

The dimensions of force are given by M/LT7~. This can be equated 
to the dimensions of the quantities with which we are dealing, since 
it cannot depend on any other quantities. Thus, 


MLT~ = [p]* [u)* [L]" (n)°, 
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i.e., 
MLT~? = M*L~**L’r PLY MeL rT. 
Hence, 
até=1, —3e+fh+y—6=1, —p—6=—2. 
expressing a, B and y in terms of 6, we obtain 
a=-1—6, p=2—6, y=2—6. 


Thus, in the most general case, F may be expressed in the fornmi of a 
sum of terms, each of which has the indicated dimensions, i.e., 


F = A[p! ou? on? | = pu2L2A [ (<2) ~*] ’ 


where A represents numerical coefficients. We have thereby demon- 
strated that the force must be given by the formula 


F = Kou*L*f (Re). 


This result has been obtained only by considering the dimensions! 
The function f (Re) is not known and must be determined experi- 
mentally. 

Definitive formulas for limiting cases may be obtained by simple 
reasoning. If the velocity is small, F must be proportional to the 
first power of the velocity w. For this to be true, f (Re) must be equal 
1 


ta Re 


and, therefore, 
; F=Kvwrul. 
The numerical value of the constant depends on the form of the 
body. For a sphere, 
F=6x ur, 
where r is the radius of the sphere. The last formula is known as 
Stokes’ formula. 


Example. A mercury globule (r = 0.53 mm), sinking in glycerine with 
a velocity of 0.6 cm/sec, experiences a force of friction of about 8 dynes. 


In the case of very large velocities, the fluid motion with respect 
to a body ceases to be steady. Eddies appear and the motion becomes 
turbulent. The body motion may be steady, but the fluid particles 
move more or less randomly. Owing to the intense nature of the 
disturbance, the transfer of motion from layer to layer ceases to 
depend on the viscosity. This can only occur if f(Re) approaches a 
limit as the velocity increases. Therefore, for large velocities, the 
resistance force becomes proportional to the velocity squared: 


F=KpwL?, 
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84. Coefficients of Diffusion, Viscosity and 
Thermal Conductivity for Gases 


The processes of equilibrium establishment in gases are closcly 
related to the characteristics discussed in the previous article. Tem- 
perature, concentration and velocity equalisation of some parts of 
a gas with respect to others occur owing to the mixing of the mole- 
cules. The rate of this mixing is determined by the role played by col- 
lisions between particles. For example, in case of a large free path, 
the fast molecules quickly penetrate into the regions where the slow 
molecules are located and distribute themselves throughout the gas. 

It is quite natural that the time of equalisation in all three processes 
should be of the same order of magnitude as the time between 
molecular collisions. This may be verified by theoretical calculations 
for particular cases, but we shall not concern ourselves with this 
problem. 

Taking the equation for the equalisation time to be t = - , where- 
by a dimensionless constant of proportionality whose order of 
magnitude is usually equal to unity is omitted, we obtain on the 
basis of Sec. 81 perfectly identical expressions for the coefficients 
of diffusion,* kinematic viscosity and thermometric conductivity 
(assuming L ~l): D~v~ yw ol. 

The following table indicates the accuracy obtained: 


air hydrogen 
v=0.13 v=0.94 
x¥=0.18 ¥=1.3 
vl=0.27 vl=1.9 


These results should be considered good. Agreement within an 
order of magnitude cannot be viewed as accidental when it is recalled 
how greatly the quantities with which we are dealing vary. 

Using the expression for the coefficient of thermal conductivity 
in terms of the thermometric conductivity, we obtain 

MC pv 
%~ pulp ~——» 


where m is the mass of a molecule. 


* It should be kept in mind that, in addition to the concept of diffusion 
of one substance in another, there is the concept of self-diffusion, i.e., the 
motion of molecules among similar molecules, e.g., the diffusion of hydrogen 
in hydrogen, oxygen in oxygen, etc. Investigation of this phenomenon became 
possible after the technique of radioactive tracers (atoms and, hence, mole- 
cules) was introduced. 

Thus, D is here the coefficient of self-diffusion. 
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In this formula, 2, the number of molecules in a unit volume, has 
cancelled out. It follows, therefore, that the thermal conductivity of 
a gas does not depend on its density and, hence, pressure. We should 
carefully note this unexpected, but nevertheless perfectly correct, 
conclusion. Increasing the density of gas does not lead to an increase 
in thermal conductivity. 

Another prediction may be made on the basis of the formula for 
the coefficient of thermal conductivity. Since the effective cross- 
section and the thermal capacity hardly depend on the temperature 
(generally speaking, o decreases slightly with increasing tempera- 
ture), and the thermal velocity is proportional to ]/ 7, the coefficient. 
of thermal conductivity should be proportional to the square root 
of the temperature. 

The data presented below indicate the accuracy of these two pre- 
dictions. For example, for nitrogen at O°C, 325°C and 500°C: 


Mg i Xi %,=1.93:1.65:1 
VT3:V T.: VT, = 1.68: 1.48: 1. 


We see that the thermal conductivity increases with temperature 


somewhat more than proportionally to | 7. This is due to changes 
in the cross-section and thermal capacity. As can be séen, the thermal 
conductivity is independent of pressure in a very broad interval. 

Similarly, the dynamic viscosity 1 ~ orl also does not depend on 
the pressure and density of the gas. The temperature dependence of 
the viscosity of an ideal gas should be the same as that of the thermal 
conductivity, i.e., the same proportionality should exist. A numeri- 
cal example will help to fix this point. 

For nitrogen (7, = 273°, T,== 289°, T; = 296°): 


N32 Noi y= 1.06:1.04:4 
VT3:V To: VT, = 1.04:1.03: 4. 


The viscosity of a gas rémains amazingly constant with changing 
pressure. When the pressure of CO, changes by a factor of 380—from 
2 mm to 760 mm of mercury—the viscosity practically does not 
change. It remainis at all times equal to 14.8 x 1075 poises—to within 
an accuracy of one unit in the third figure. 
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When the free path length in a gas is greater than the linear 
dimensions of the vessel, we say the gasisultra-rarefied. Under normal 
conditions, the magnitude of the free path length is of the order of 
40-5 cm and is inversely proportional to the density. Therefore, at 
a pressure of the order of 10™* mm of mercury, the free path length 
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is measured in tens of centimetres. For vessel dimensions of about 
10 cm, a vacuum or ultra-rarefied gas is obtained at such a pressure. 

It should be noted that even in a vacuum the number of molecules 
per unit volume is large. For the pressure indicated above, 1 cm?® 
of gas contains tens of thousands of millions of molecules. 

When molecules cease to collide with one another, and collide 
only with the walls of the vessel, the gas acquires certain special char- 
acteristics. A number of concepts become meaningless under such 
conditions. Thus, it is no longer possible to speak of the internal fric- 
tion of the gas molecules, since there can be no molecular layers 
exchanging velocities in the gas. It isno longer possible to speak of the 
pressure of one part of the gas with respect to another (however, the 
concept of gas pressure against the walls of the vessel retains its mean- 
ing). Also, the concept of heat exchange between different parts of 
the gas and, in general, all concepts related to interaction between 
different parts of the gas become meaningless. An ultra-rarefied gas 
interacts only with bodies within the gas. 

It will be useful to illustrate by means of examples the specific 
character of vacuum as a special physical state of a gas. we 

What is the expression for the heat flow from one plate to another 
when these plates have different temperatures 7, and 7’. and are 
located in a vacuum? The essence of heat exchange in this case consists 
in the following: gas molecules strike a wall and rebound from it with 
an average velocity corresponding to the temperature of this wall. 
As regards the expression for the heat flow, examining the familiar 
formula 
=pcvl Arh ‘ 
we see that the change consists in the fact that the role of free path 
length is now played by L, the distance between the walls. Therefore, 
the expression for heat flow should assume the following form in 
the case of ultra-rarefied gases: 


gan tp 


g=pcv (T,;—T-). 


According to this formula, when ultra-rarefied gases are rarefied 
still further, the heat flow should decrease after the free path length 
becomes comparable to the linear dimensions of the vessel. And this 
is precisely what experiment shows to be the case. 

Also peculiar to an ultra-rarefied gas are the equilibrium condi- 
tions for a gas in two communicating vessels at different tempera- 
tures. In the case of a usual gas, the gas pressures in both vessels are 
the same at different temperatures, but the gas densities are differ- 
ent, i.e., they are inversely proportional to the temperatures. 
Equality of pressures is necessary for equilibrium, for otherwise gas 
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molecules will be knocked from one vessel into the other as a result 
of molecular collisions. 

In the case of a vacuum, the situation is completely diflerent. In 
this case, no collisions occur between molecules and as a result the 
molecular flow between vessels is not impeded. The equilibrium 
condition consists in equality of molecular fluxes. If there are n 
particles in a unit volume and the particles move with a velocity v, 
then nv molecules pass through a unit area per unit time. Thus, for 
equilibrium, ny, = nogve. Since the number of molecules in a unit 
volume is proportional to the pressure divided by the temperature 
(this follows from the equation of state for an ideal gas) and since 
the molecular velocity is proportional to the square root of the tem- 
perature, the condition for equilibrium assumes the form 


Py Pe : 

VT, VTao 
Thus, it is not the pressures that are equal, but rather the ratios | 
the pressures to the square root of the temperatures. If the gas de 
sity is increased, the pressures gradually begin to be equalised < 
the usual equilibrium condition is obtained when the free 1 
length becomes sufficiently small. 


PART TWO 
ELECTROMAGNETIC FIELDS 


CHAPTER XIV 
ELECTRIC FIELDS 


86. Vector Properties of Electric Fields: Intensity and Displacement 


The presence of an electric field in a region may be recognised by 
a variety of properties. Thus, an electric field creates a force that acts 
on electric charges. Also, it can induce electric charges on the surface 
of a neutral metallic body. 

By measuring the force acting on a charge g, one can show that 
the force F has different magnitudes and directions at different points 
in space, and at a given point is proportional to g. Hence, it is pos- 
sible to describe an electric field by its intensity EZ, which is defined 
as follows: 


The stipulation should be made that q must be small. Then, H may 
be measured at points in space that are sufficiently close to each 
other and the field created by charge q does not noticeably distort 
the measured field. 

A vector field is frequently characterised by so-called vector flow 
lines. The tangent at each point of such a line coincides with the di- 
rection of the vector at this point. This also holds for electric fields, 
which may be characterised by vector flow lines of electric intensity FE. 


Numerical examples. 1. The electric field intensity of an incandescent wire 
is tens of volts per centimetre. : 
2. The electric field intensity of the Earth close to its surface is ~ 
1 statvolts : : 
~100 volts/metre = =. ————-._ (Gaussian units). 
300 cm 
3. The electric field intensity of a hydrogen atom’s nucleus at a distance 
corresponding to the radius of the electron’s ‘‘orbit’’ is 19.2 10° statvolts/em = 
= 57.6 X 1010 volts/metre. 
4. The elcctric field intensity at which air breakdown occurs is 30 kv/em = 
= 100 statvolts/cm. 


An experiment for the determination of the charge induced by a 
field may be conducted using two small metallic plates fastened to 
an insulated handle as shown in Fig. 89. Such plates are called Mie 
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plates—after the German physicist G. Mie. Placing the pair of closely 
spaced plates in a field, and then carefully rotating them, positive 
charge may be accumulated on one plate and negative charge on the 
other. Moreover, the quantity of induced electricity may be measured 
by an electrometer or ballistic galvanometer. 

Experiments show that the plates may always be so oriented_that 
no electricity is induced on the plate faces. For homogeneous, iso- 
tropic bodies (and for the present we shall not consider others), this 
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Fig. 89 


occurs when the plate faces are parallel to the vector #. On the other 
hand, the induced electricity is a maximum when the plate faces 
are perpendicular to the vector HZ. This enables us to introduce still 
another vector to describe an electric field, namely, the electric dis- 
placement 2, which is defined by the following condition: The vector 
® is normal to the Mie plates when the orientation of the plates is 
optimal with respect to induced charge, i.e., when a maximum 
charge is induced on them. Moreover, this vector is directed outward 
from the positive Mie plate. In all cases, except for anisotropic bodies, 
® and EF have the same direction. The absolute value of 9 is equal too: 


|2|=6, 
where o is the surface charge density of the Mie plate. Since the sur- 


2 ; d 
face charge density o may be written as a then 


A 

dq 
-|\9|/=-~=. 
2 |= 
It was stated above that the electric field may be characterised 
by the flow lines of vector H#. We can, of course, also describe the 
field by the flow lines of vector 9, i.e., the electric lines of force. 
The number of lines of force passing through a unit area perpendicular 
to the force lines is |9|=, and the quantity ; 
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is called the electric flux through the area dS. If the same electric 
flux passes through the inclined area dS as through dS,, then 


aS | 
cos @ 


dS = ; 
where a is the angle between the normal to the area and the lines of 
force, i.e., . 


dN=®DcosadsS. 


The flux through a large surface is expessed in the form 
N= \ Dcosads, 


and the flux through a closed surface is usually denoted by placing 
a circle on the integral sign: 


N= Deosads. 


87. Permittivity 


Experiments show that the two vectors characterising an electric 
field are related. For the case when these vectors are parallel, they 
are also proportional to each other at a given point in space.* 
A change in the vector # results in a proportional change in the vector 
®. The ratio 2 depends only on the medium. 

It is customary to characterise the electric properties of a medium 
by the dimensionless quantity ¢, whose value is selected so that ¢ = 1 
for vacuum. The reason for this condition is the fact that, as will 
presently be seen, no body can exist with e< 1. Therefore, it is 
natural to “base” the value of € on vacuum. The quantity ¢ is. 
called the permittivity and defined by the equation 


2 = xe, 
where &) depends on the choice of units. If the state of the medium 
does not vary from point to point, then e is also constant. At the 
boundary between two media, e¢ changes abruptly. Bodies that are 
nonhomogeneous with respect to density and other properties are 
usually nonhomogeneous with respect to permittivity as well. 
The permittivities of several substances at 18°C are as follows: 
air—1.00059, glass—7.00, paper—2-2.5 and water—80.5. 


* The case of anisotropic media, where the vectors D and # are not paral- 
lel, will be considered on p. 257. 
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In engineering, the quantity ® is measured in coulombs/metre * 
and the field intensity in newtons/coul. Then, €9 is measured in 
coul*/metre® newton and, in these units, 

10-9 coul? 


&0="36n metre? newton * 


In the Gaussian system of units used in physics, €9 is dimensionless 


and equal to < 


5) ee 


4h 


A quantity called the electric induction D may be used instead of 
displacement. It is 4x times larger than displacement and in the Gaus- 
Sian system D = eL. 

As we shall soon see, both choices for the value of &9 have their 
relative advantages. The first system simplifies one group of formulas 
but complicates another, while the second system leads to the reverse 
result. 

It should be emphasised that the concepts of electric displacement 
and electric induction have exactly the same qualitative meaning. 
The difference in the numerical factor merely leads to a difference 
between the ratio of an electric induction unit to a charge density 
unit and the ratio of a displacement unit to a charge density unit. 

The electric displacement is equal to unity if the charge density 
on the Mie plates is equal to unity (see p. 227), while the electric 
induction is equal to unity if the charge density on the Mie plates is 
equal to ~: 

In electrical engineering, as a rule, only the quantity ®, i.e., 
displacement, is used. On the other hand, in physics, the electric 
induction D is used exclusively. 


Several comments are necessary regarding the formulas and units of meas- 
urement that are used in this part of the book. 

Although in mechanics and thermodynamics various choices are made for 
the fundamental quantities and various units of measurement are used, never- 
theless, the constants of proportionality are invariably dimensionless. There- 
fore, the form of the formulas in those fields of physics does not depend on the 
choice of the system of units. 

Unfortunately, however, the situation is not the same in the case of elec- 
tromagnetic ficlds. Two general approaches exist, i.e., one approach has been 
adopted in electrical engineering and another in physics. Not only is there 
a difference in the choice of the fundamental quantities and the units of meas- 
urement, but it turns out that we must distinguish between the constants of 
proportionality for one and the same formulas. Of necessity, one must become 
familiar with the formulas in both systems. This will be done in the course of 
the presentation, but for the present it suffices to limit ourselves to several 
general comments. 
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In electrical engineering, the so-called MKSA system is widely used. Side 
by side with the metre, kilogram and second, a unit of current is taken as 
fundamental. A current of 1 amp is defined as the current for which the constant 
of proportionality to, i.e., the magnetic permeability of vacuum, which occurs 
in formulas for electrodynamic interaction (sec p. 273), has the value 


[lo =42 X 10-7 joule/amp? metre. 


Experiments show that for such a choice of unit current 1.118 mg of sil- 
ver is deposited per second on an electrode when a current of 1 amp passes 
through a silver nitrate solution. The historical reasons for this choice, which 
may appear strange, will not be explained here. 

All units in the MKSA system may be expressed in terms of the kilogram, 
metre, second and ampere. 

Since in electrical engineering the system of units is based on four funda- 
mental quantities, there is no way of obtaining the same set of formulas in the 
CGS system, which is based on three fundamental quantities. There are, how- 
ever, other differences between these two systems. These are expressed in the 
different choice of numerical, dimensionless coefficients. In the course of pres- 
entation, we shall on occasion list certain formulas in both systems, while 
in the appendix the reader will find a compilation of the electrodynamic for- 
mulas in both systems with the units of measurement indicated. 


88. Electric Field Relations 


Consider a system of electrically charged bodies forming an arbi- 
trary field. Now, describe a closed surface in this field. Some of the 
charges will fall within the surface, while others will be external 
to it. The result obtained by measuring the electric flux in the out- 
ward direction through this surface is very simple and in no way sur- 
prising. Thus, the total electric charge induced on the surface— 


which by definition is precisely the flux V = b D cos a dS—is equal 
to the total electric charge within the volume enclosed by this surface: 


§ D cosa. dS = >, qi- 


This theorem, named after Gauss and Ostrogradsky, shows that lines 
of electric flux begin on charges of one polarity and terminate on 
charges of the other polarity. Interrupted lines of force do not exist. 

Lines of electric intensity closed on themselves do not exist in 
a constant electric field.* This follows from the second law for elec- 
tric fields, which states that an electric field (more accurately: the 
vector field of electric intensity E) is a potential field. Thus, the 
work performed in moving a charge along a closed curve is equal to 
zero in such a field, i.e., closed lines of vector H do not exist. The 
work performed in moving a charge from one point to another 


* In vacuum and homogeneous media, the # and ® vector lines coincide. 
In this case, we can speak of the electric lines of force without indicating which 
of the vectors is being considered. 
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depends only on the location of these points and does not change when 
the form of the path changes. In this respect, the properties of an 
electric field are the same as those of a gravitational field. 

Let us select a reference (initial) point in an electric field and cal- 
culate potential energy with respect to this point. No matter what 
path is taken, the work A in moving a charge from the initial point 
to a given point in the field is always the same. Therefore, at this 
point the charge possesses a potential energy U that is numerically 
equal to the expended work A. 

Just as the potential energy in a gravitational field is proportional 
to the mass of a body, the potential energy in an electric field is 
proportional to the charge: ; 


U=Qq. 
The quantity 9 = ce , i.e., the potential energy that a unit positive 


charge would possess at a given point in the field, is called the electric 
potential of the field or, simply, the potential. 

The expression for the work performed in moving a charge from 
one point of the field to another follows from the definition of poten- 
tial. Since work is equal to the change in energy, i.e., dA = —dU, 
then 

dA=Fdl=qE dl=—qdq, 


or Hdl= —dQg, 


where d@ is the change in potential. 
For a finite portion of the path 


2 
\ Edl=o1— 92. 
1 


Thus, the potential difference* is equal to the workexpended in 
moving a unit charge. 
If a charge moves along a line of force, vectors need not be used. 
Then, 
2 


| Edl=o:—92. 
1 


* In a variable field, the above equation is not valid. To avoid confusion, 
2 


it is convenient to introduce a separate term for \ Edl. We refer to it as the 


electromotive force (emf) along the path between points 1 and 2. For constant 
fields, the emf and the potential difference are equal. 
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Finally, in a uniform field, the formula is simplified to 


_— Pi-P2 
BS d : 


where d is the distance between points 1 and 2. 
Formulas relating E and @ are written without constants of pro- 
portionality and have the same form in all systems of units. 


Examples. 1. Assume that two flat electrodes of area S = 10 cm? are located 
in air at a distance of 5 mm apart and that the potential difference be- 


tween them is 5,000 volts. The intensity of the created electric field is E = 
= 10° volts/metre= 23 statvolts/cm (Gaussian units) and the electric displacement 
in the field of this condenser is D = e94E=9%X 10-* coulomb/metre?. This means 
that the charge density on the condenser plates is o = 9 X 107° cou- 
lomb/metre? = 2.7 statcoulombs/cm? (Gaussian units). The electric flux through 
the face of the electrode is N = DS = 9 K 10-9 coulomb and the charge on 
one plate is q = oS = 9 X 10-9 coulomb. Thus, N = q, which agrees with 
the Gauss-Ostrogradsky theorem. 

2. The electric displacement of the Earth's field close to its surface is 2 ~ 
~ 9X 10710 coulomb/metre®. Since the area of the Earth’s surface is S ~ 
~ 5 X 104 metres? and the surface charge density is o ~ 9 X 1079 cou- 
lomb/metre?, the Earth's charge is q ~ 4.5 X 105 coulombs. Thus, the electric 
flux passing through the Earth’s surface is N ~ 4.5 x 105 coulombs. 


89. Field Calculations of Simple Systems 


Using the electric field relations presented in the previous article 
and from general considerations of symmetry, we can determine the 
field for certain simple systems. To determine the field means to 
calculate the electric intensity, induction or potential. It should be 
noted that knowledge of the potential suffices to characterise the 
field. If @ is known at all points in space, we can determine the value 
of vector # by differentiating @. This becomes particularly clear 
if we construct surfaces of equal potential (equipotential surfaces) 
satisfying the equation @ (z, y, z) = const. Since the work of moving 
a charge along an equipotential surface is equal lo zero, the lines of 
force are directed normally to the equipotential surfaces. Thus, to 
determine the value of | F |, one must differentiate —p (x, y, 2) in 
the direction of the normal. This type of mathematical operation 
is considered in vector analysis. However, such differentiation is 
easily performed graphically using a curve in which 9 is plotted as 
a function of the coordinates along a line of force. The tangent of 
this curve’s angle of inclination at any point is equal to the negative 
of & at that point. 

In order to enable the reader to better understand these new con- 
cepts, we shall use examples to analyse the peculiarities of potential 
and the vector characteristics of a field. We repeat, however, that in 
principle knowledge of the potential suffices to solve the problem. 
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Point Charge. From considerations of symmetry, one can see that the 
field of a single, point charge is a radial, spherically symmetrical field. 

Consider a sphere of radius r. The electric flux emanating from 
a charge q is equal to 


\ DecosadS = q. 
The angle a is the angle between the lines of force and the surface 
of the sphere, i.e., it is equal to 90°. At all points on the surface, 


> has the same value and may, therefore, be brought out in front 
of the integral sign. Then 


Dh dS =q 


and, since Gas = 4nr* (area of the sphere), the electric displacement 


at a point located at a distance r from the charge is D = 7a and 


the electric induction is D = +. 


The intensity of the electric field is 


q 


1, <== ——_—_., 
ANEger? 


In this case, the Gaussian system of units in which &) = Zz is more 


convenient. Then / = 4; and in the case of vacuum 


q 


~ Pe 
Since the field intensity is equal to the derivative of the negative 
potential along the line of force, i.e., 


y —— 


the expression obtained for the potential of a point charge is 
q 


r 


The constant of integration has been assumed to be equal to zero. 
This fixes the reference potential » = 0 at infinity. 

Thus, the potential of a point charge is inversely proportional to 
the first power of the distance, while the intensity is inversely pro- 
portional to the distance squared. 

If the charge is in a medium whose dielectric constant is e, the 


intensity and the potential are reduced to 4 of the values in vacuum~ 
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The Earth’s potential is equal to 0.07 volt if the potential at infinity is 
taken equal to zero. In electrical engineering, the potential of the Earth is 
assumed to be equal to zero. 


Systems of Point Charges. Let us consider methods of calculating 
fields created by systems of point charges. Assume e¢ = 1 and let us 
use the Gaussian system of units. Then, the potential for a system 
of charges may be written in the form 


ty Oe 08 th a 
ae aa rs Cree eae erg 


where r;,, is the distance from the charge gq, to the point of observation. 


In the case of two charges of equal magnitude, but opposite sign, 
we obtain 


When the signs are the same: 


{ 4 
Peete) 
The form of the above formulas may turn out to be inconvenient 
for solving a given problem. Thus, it is often expedient to introduce 
a Cartesian system of coordinates and express the radius r;, in terms 
of x, y, z. In the case of two charges, separated by a distance 2a, it 
is convenient to locate the origin of the coordinate system at the 


evel of the system, with the z-axis passing through both charges. 
1en, 


ri=(x—a)?4+ y?+2? and r2?=(z+a)?+y?+ 22. 


Sometimes it is expedient to represent the potential as a function of 
the polar coordinates R and g. From Fig. 90, one can see that 


ry =V R? 4a?—2aR cos @ and r= V R? +a? + 2aR cos 9. 


The field intensity of a system of point charges is given by the vec- 
tor equation 


Res es Th ™h 
SPE Be ee py pe ape Se hg 
i 4 2 72 r3 3 ROR 


Here, a is a unit vector in the direction of radius ?°,;. 
2, 
Vector addition is used to map the lines of force. 
Universal Formula for Potential. When a field is created by volume 
and surface charges instead of point charges, the potential of the 
field may be calculated if the charge distribution is known. 
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Consider the region of the volume charge to be divided into 
infinitely small volumes dv and the area of the surface charge to be 
divided into infinitely small elements dS. lf p = 1 is the volume 
density of the charge and o = ae is the surface density, the potential 


created by a volume dv is equal to pe and the potential created 


Fig. 90 


by a surface element dS is equal to ss . Adding the potentials creat- 
ed by all the elements, we obtain: 


g=f PP alse. 


r 


The radius r is drawn from the point of observation to all the points 
in space where charges pdv and odS are concentrated. 

This formula is rarely used since the charge distribution as a func- 
tion of p and o is not usually given. In fact, the charge distribution 
is generally being sought. 

Field of a Spherical Condenser. Consider a sphere of radius r, 
having a charge-+ gq and surrounded by a concentric, spherical sur- 
face of radius ry. It is convenient to view the external sphere as 
grounded, whereupon a charge —gq is induced on its inner surface. 
Considerations of symmetry indicate that the field is radial. If we 
describe a sphere of radius r between the condenser spheres and 
apply the Gauss-Ostrogradsky theorem, the result obtained does 
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not differ from that for a point charge, i.e., 


ee, 


r2 


The potential equation has the form 
p= a. + const, 


but the constant in this case should not be discarded as was done 
previously. As is well known, the potential of grounded metallic 
parts is usually taken equal to zero. It will be, therefore, more con- 
venient to set p = 0 at r=rzy rather than at infinity. We then 


obtain: const =—-“, 


'B 
The expression for the potential in the region between the spheres 
has the form 


sees. pee I 
¥ = r rB 
On the surface of the inner sphere, 
eae eee 
TA rR 


Recalling that the ratio of the charge to the potential difference 
between the conductors of a condenser gives the capacitance, we 
obtain for the capacitance of a spherical condenser 
ae 1.) oars 
C= 1 Le 7 ra—ls A 
TA TB 

If the radius of the outer sphere is increased (rz — oo), the expres- 

sion for the capacitance is reduced to 
C=f, 

Thus, the capacitance of a single sphere is equal to the magnitude of 
its radius. 

If the dielectric between the conductors of the condenser has 


eee : : : 4 
a permittivity e, the intensity E and the potential @ decrease to — 


€ 
of the above values. 
From the formula 


vot (S38), 


TA 'B 


TATB 
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and for a sphere, 
C=er. 


Thus, the capacitance is ¢ times the value obtained for vacuum. 
The potential and field formulas being used are applicable for 
points in the region between the conductors of a condenser. They 


e v 


Fig. 94 


are not applicable to points within the first conductor or external 
to both conductors, for Gauss’s theorem yields different results for 
these points. 

If the charge of the internal sphere is concentrated on its surface, 
then for points within the sphere 


§ Dcos a dS =0. 


Since an analogous relation is valid for every surface within the 
sphere, this requires that D = 0 and, hence, the field intensity is 
also equal to zero. Thus, Gauss’s theorem shows that there is no 
field within the sphere if all the charge is distributed on its surface. 
Since EF = 0, the potential @ remains constant and equal to the 
value of @ on the surface of the sphere. The above is illustrated in 
Fig. 91 by the curves of E and @ as functions of r. 


Examples. 1. The electric field intensity at the Earth’s surface is 


7,400 \2__ 
( £780 ) = 1.33 


times the intensity at a distance of 1,000 km from the surface. 
2. The Earth's capacitance is C = 6.4 X 108 cm (Gaussian units) = 700 pf. 
3. The capacitance of condensers used in radio engineering may be as small 
as : een of a picofarad (1 pf = 10712 fd) and as large as thousands of mi- 
crofarads. 


Field of a Uniformly Charged Sphere. It is evident that outside 
such a sphere the field is the same as for a point charge or a surface- 
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charged sphere, i.e., 


q 
E = 2 ’ 
where r is the distance from the centre of the sphere and q = 4 a%o 


(0 is the charge density and a is the radius of the sphere). 

To determine the field inside the sphere, consider an auxiliary 
sphere of radius r< a. The quantity of electricity inside this sphere 
is less than q, being equal to 

4 3 r3 
ag wr p= ae q- 
According to Gauss’s theorem 


3 
D x 4nr? =; gq, 


i.e., 
pas emer 
2 anae 
Hence, the electric field intensity is 
Geaene: Dene 
E= Ase gea3 M 


or in Gaussian units: 
q 
=—or. 
E ea3 


It should be noted that the field is equal to zero only at the centre 
of the sphere. Then, as shown in Fig. 92, the field increases linearly 


and becomes equal to Ay at the surface 


of the sphere (r = a). Here, the for- 
mula for the field outside the sphere 
and the formula for the field inside 
the sphere yield the same result. From 
this radius outward, the field de- 
creases in accordance with an inverse 
square relationship. The potential 
r outside such a sphere is again given 


é 


by <. Inside the sphere, the value of 


Fig. 92 ~ does not interest us and will not 
be considered. 
Cylindrically Radial Field. Let us consider the field created by 


a uniformly charged line or cylinder having a charge + per unit 


length. Outside the charged region, the fields of such systems are 
the same and have the following form: the lines of force are at right 
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angles to the axis of symmetry and the flux is the same in all radial 
directions. 

In order to apply Gauss’s theorem about the cylindrical line, let 
us consider an auxiliary cylindrical surface of radius r and unit 
height. Since the flux passes only through the lateral surface of this 


cylinder, § ®D cosa dS. is equal to the integral over the lateral surface. 


Owing to symmetry (cos a = 4 and ® is the same at all points of 
the cylinder), 


2 as=4, 
i.e., 
ms 
® x 2ar =1 and P=. 
attr 
Hence, the field intensity is 


E= 


4 
l 
2Meger 

or, in Gaussian units, 
gf 
l 


tr 


Thus, the field of the cylinder is inversely proportional to the 
distance. This formula is equally valid for the region about a charged 
line, the region outside a charged cylinder and the region between 
the conductors of a cylindrical condenser. 

Since dp = —Edr, we obtain for the potential: 


_24q 1 
@ rae! = + const. 


The potential decreases much slower with increasing distance than 
in the case of spherical systems. Thus, for example, when the distance 
ry is increased to 10 times its original value, the potential decreases 


to a of its value rather than to - 
For a cylindrical condenser, where the radii of the cylinders are 
q and 0, we obtain 


2 2 a 
(o—Pa=+ 4 (Ina—Inb)==4tins. 
The capacitance per unit length of such a condenser is 


ee: 
2In= 
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Example. For a coaxial cable having an outer radius a = 18 mm and an 
inner radius 6 = 6 mm, and filled with an insulator of relative permittivity 
€ = 4.2, the capacitance per unit length is C = 1.91 cm/em = 2.12 pf/cm. 


It should be noted that the formulas derived above do not take 
account of field distortion at the ends of the cylinder and hence, 
strictly speaking, are valid only for infinitely long cylinders. Prac- 
tically, however, the derived formulas are valid if the region of 
ered field is significantly smaller than the undistorted radi- 
al field. 

Uniform Fields. Uniform fields, i.e., fields in which the lines of 
force are parallel and evenly spaced, are created by charges in planes 
of infinite extent. Naturally, the flux is perpendicular to such planes. 
The magnitude of the field is again determined by means of the 
Gauss-Ostrogradsky theorem. Thus, let us consider an auxiliary 
surface in the form of a cylinder passing through a plane of charge. 
If the lateral surface of the cylinder is perpendicular to the plane, 
the flux through the auxiliary surface is equal to the flux through 


the two end surfaces of the cylinder. The integral §D cos adS is 


then equal to 2S, where S is the area of the cylinder base. The 
charge within the cylinder is equal to oS. Hence, the formula for 
the displacement becomes 
oO 
=o es 
The electric field intensity is E = os and in Gaussian units E = ig 
; ‘ 0 : 
We see that the intensity does not depend on the distance to the 
sources of the field. Let us now consider a parallel-plate condenser. 
The field inside spherical and cylindrical condensers is created only 
by the inner surface of charge, while in a parallel-plate condenser the 
field between the plates is created by both surfaces of charge. Just 
as in the case of the condensers considered above, there will be no 
field outside the condenser. Between the condenser plates ® = o and 
the intensity in Gaussian units is 


4 
E=—~o 
€& 


In writing the expression for the potential of a uniform field, let 
us reckon the distance z from one of the charged plates in the 
direction of the lines of force. Thus, in the case of a single plate, ths 


potential is written in the form p= — sa ox + const. In the case 


of a condenser, the expression for the potential between the plates 
becomes 


4n 
Q=—- or + const. 
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Hence, the potential difference is 
4x An 
fa —fo == 9 (2p — Xa) = = od, 


where d is the distance between the plates. Thus, the capacitance 
per unit area of a parallel-plate condenser is 
C=z5 or, in practical units, a 

The above formulas are exact only for plates of infinite extent. 
In practice, they may be employed if the effect of the condenser edges, 
where the nonuniformity of the field is pronounced, is not 
great. We can determine the field at some point by means of the 
derived formulas only if this point is sufficiently far from the edges. 
More specifically, this condition means that the field created by 
elementary charges located at the edges of the plates should be 
much less than the field created in the neighbourhood of the point 
under consideration. 


Example. Let us return to the condenser considered on p. 232. The distance 
between the plates will be doubled using two different methods: 
Method 1: The plates remain connected to a source of voltage U = 5 kv. 
Then, .Cy= 298 =18 pl, = CU =9x 10-9 coul, £)= = 
1 4 
= 10% volts/metre, D, = 9 X 107® coul/metre? and N;, = 9 X 1079 coul. 
After doubling the distance between the plates, we obtain: Co = 0.9 pf, 
qo = 4.5 X 107% coul, Ey = 0.5 X 10% volts/metre, D2 = 4.5 X 1078 coul/me- 
tre? and N. = 4.5 X 1079 coul. Thus, half the charge entered the source. 
Method 2: Before doubling the distance, let us disconnect the plates from 
the source (condenser charge q const). Cy = 0.9 pf, q= q; = 9 X 1079 coul, 
: U 
Ug = 7 = 10 kv, Ey = _ = E;, 2o= DB, and No = Ny. Thus, the 
2 2 
vollage on the plates doubled at the expense of the work of external forces. 


Field on the Surface of a Metal Object. There is no electric field 
inside a metal object. This follows from the fact that all of a con- 
ductor’s charge is located on its surface. According to Gauss’s theo- 
rem, the field is directed outwardly. 

The surface of a metal object is obviously an equipotential sur- 
face, for otherwise the electric charges would redistribute them- 
selves on thesurface of the conductor. It follows, therefore, that the 
lines of flux leaving the surface of the metal object must be perpen- 
dicular to the surface. Since all the flux leaves the surface in a sin- 
gle direction, then, according to Gauss’s theorem, ®D = o lines 


emerge from unit surface area. In other words, the field intensity at 
: : : : 40 
the surface of the conductor is equal, in Gaussian units, to E = a 


Electric Images. Let us consider the electric field created when 
a point charge is placed near a plane metallic surface. Due to elec- 


16—1409 


tric induction, an electric charge of opposite sign accumulates on 
the surface of the metal near the point source. The density of the in- 
duced charge is greatest directly opposite the point source and de- 
creases to zero at infinity. Similarly, for the electric field. 

Let us now consider this problem quantitatively. Since the sur- 
face of a conductor is an equipotential, the conducting surface may 
be considered grounded without in any way reducing the generalily 
of the problem. Hence, the potential of the metal plate is equal 


y 


QW 


a) b) 
Fig. 93 


to zero and there is no field inside the plate. We are interested in 
the clectric field in the right half-space. The electrical properties of 
this half-space are uniquely determined if the magnitude of the 
charge and its distance from the equipotential plane are given. It is 
important to note that it is entirely immaterial what is located to 
the left of the zero-potential surface. It is proved rigorously ‘in 
courses on mathematical physics that the field in a particular 
region is uniquely determined if the charge in this region and the 
boundary conditions for the potential are given. 

In Fig. 93a, the field is plotted for two charges of equal sign. Now, 
consider the space of this field to be divided into two symmetrical 
parts. The half-space of this figure is then exactly equivalent to the 
half-space of a charge near a metal plate (Fig. 93b) and the fields of 
such half-spaces should be identical. This is the basis for the follow- 
ing procedure, known as the method of images. We “reflect” the 
electric charge in the surface of the metal plate. In the right half- 
space, the electric field of the charge and its “image” should coincide 
with the unknown field. Thus, the unknown electric field is expressed 
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by the formula 


where 7; is the distance of the observed point from the charge and r, 
is the distance of this point from the charge’s image. 

The second conclusion to be drawn is that the electric charge is 
altracted to the surface of the metal plate wilh the same force as to 


its electric image, i.e., the force of altraction isa , where a is the 


distance of the charge from the surface. 

Finally, this approach to the problem enables us to determine the 
distribution of the induced electric charge on the surface of the 
metal plate. This requires that we differentiate the expression for 
the potential in the direction normal to the surface. We obtain 
thereby the electric field intensity £, which in accordance with the 
formula given in the preceding section of this article must be multi- 


plied by = to obtain the charge. 

The method of clectric images has numerous applications and 
enables us to solve electrostatic problems involving systems of 
nonplanar conductors with point charges located in the vicinily 
of the conductors. 


90. Electric Energy 


Energy of a Condenser. It is easily demonstrated that a charged 
electric condenser possesses energy. Moreover, the measurement of 
the magnitude of this energy is not difficult. Thus, for example, we 
can discharge a condenser through a conductor and measure the ther- 
mal energy released thereby. However, we need not resort to experi- 
ment to determine the factors on which the electric energy of a con- 
denser depends. The formula for this energy follows directly from 
familiar theoretical propositions. 

To simplify this discussion, let us consider a condenser in which 
one of the conductors is grounded. The process of discharging the 
condenser (grounding the second conductor), which is charged to 
a potential difference p by a quantity of electricity g, may be viewed 
as the successive outflow to ground of elementary charges dq under 
the action of electric field forces. Therefore, the work performed by 
the field in this simple process is equal to pdg. As the discharging 
process proceeds, the work performed in transferring each successive 
quantity of charge to ground becomes less and less, for the potential 


difference ¢ = is constantly decreasing. The total work performed 
7 Pp 


C 
16* 
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by the field during condenser discharge is 
q 


q7,_ 9 
\ 7 4a= 3c - 


This quantity is quite understandably referred to as the electric 
energy of the condenser. Using the relationship between potential 
and charge, we obtain for the energy: 

Wrote = 


— — 5) 


2 eee Tl 2 2 


Thus, for constant potential difference, the electric energy is propor- 
tional to the charge squared. A constant difference of potential is 
maintained when the condenser is connected to a constant source. 
Moreover, if the condenser conductors are insulated, the charge is 
constant. Then, the energy is proportional to the potential squared 
and directly proportional to the capacitance of the condenser. 
Energy of a Field. In the case of a parallel-plate condenser of infi- 


nite extent, the energy formula may be written in the form We; = or 


; : : Ep ‘ . : 
when using MKS units or in the form We; = Tr when using Gaussi- 


an units. These formulas give the energy per unit condenser area. 

The energy formulas may be written in terms of intensity E rather 
than potential difference @. Making the substitution p = Ed, we 
obtain 


£9eL2 ek2 
We= 5 d, or Wer =—=—d. 
Thus, the enet uni ig. Ot eee 
; gy per unit volume is ae Let us call 2 the 


electric energy density. 

Now, let us consider an arbitrary electric field. Assume that the 
equipotential surfaces and lines of force are plotted and that the space 
is divided into small volumes dv, each of which is bounded by two 
adjacent equipotential surfaces and a lateral surface passing through 
lines of force. Each of these volumes is like a small volume in a par- 
allel-plate condenser and, hence, the electric energy associated with 


: E2 i . ih oe 
such an element is dW = — dv. If this expression is integrated over 


the entire volume occupied by the electric field, the formula obtained 
yields the electric energy of the system creating the field. 
Thus, the formula for the electric energy has the form 
2 
Wer = \ Sav. 
The significance of the above mathematical transformations goes 
beyond the formal convenience of using one or another formula. 
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This new expression for the energy enables us to speak not only of 
the energy of the system creating the field, but of the energy of the 
electric field ilself, and leads to the concept of a real electric field. 
In the case of constant fields, this conception can neither be con- 
firmed nor refuted. However, in the case of varying fields, we find 
direct evidence for the existence of electromagnetic fields (see p. 321). 
Hence, the derived formula for the energy of a field (energy of elec- 
tromagnetic matter) is of fundamental significance. 


Example. Let us continue with the example considered on p. 241. Before 
the plates were moved, the energy stored in the electric field of the condenser 
was W, = 22.5 % 107° joule and the energy density w, =4.5 joules/metre®. 
After the plates are moved by the first method (voltage U' = const), the ener- 


gy becomes W. = — = 11.25 x t07® joule and the energy density we = 


= 1.12 joules/metre? (the volume of the field doubled). The energy of the source 
increases at the expense of the work of external forces and a decrease in the 
energy of the field. After the plates are moved by the second method (g = const), 
the energy W. = 2W, = 45 X 107° joule and the energy density does not 
change, i.e., We = 4.5 joules/metre?. 


Energy of Interaction. When two oppositely charged bodies draw 
together, the work performed by the forces of the electric field is, 
naturally, at the expense of the energy of the electric field: dA = 
= — dW,,. Thus, as indicated on p. 54, the work of the electric 
forces is performed at the expense of a decrease in the potential 


energy uae This energy is appropriately called the interaction 


energy of the charges. 

What is the relation of this formula to the formula for the elec- 
tric field energy considered above? It is evident that the interaction 
energy is a part of the clectric field energy of the charges under con- 
sideration. Carefully examining the formula for the field energy, 
we note that the electric energy has definite meaning even when there 
is only one electric charge in the region. The energy of the field cre- 
ated by a single charged body is appropriately called the self-energy 
of the electric charge. We can always resolve the electric field energy 
into the self-energies of the individual electric charges and the 
interaction energies of these charges. 

Let us designate by E,, Eo, ... the field intensity created by the 
first, second, etc., charges. The total field is equal at each point in 
space to the vector sum of the intensities: FH=E,+H,+-:-. 

The electric energy density is 


E(B: t+ Et+...P= 
pit i eR ee Rd te: 


4n 
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Clearly, the individual terms of this expansion correspond to the 
energy components discussed above. Thus, the Z-squared terms 
yield the self-energies and the terms involving the product of two 
different intensities yield-the interaction energies. The interaction 
energies of the charges may be positive or negative quantities. On 
the other hand, the self-energies of the charges and the total energy 
of the field must be positive. 

As a rule, only the interaction energies of electric charges are in- 
volved in a problem. We can calculate, therefore, the work of elec- 
tric forces by determining the decrease in the energy of the field. 
The easier calculation is the one that should be performed. 


91. Electron Radius and the Limitations of 
Classical Electrodynamics 


Let us calculate the self-energy of a spherical charge, assuming 
the electricity to be distributed on the surface. The electric field 
is then only outside the charge. Therefore, the energy of the field 
must be integrated over the region external to the sphere. If the 
charge is in a vacuum, the field intensity is expressed by the formula 


a and the energy density at any point in the region has the form 
4 


Sat 
The energy contained in such a shell, whose inner radius is r and 


2 : ’ ae : ‘ 
+ . Consider the entire region to be divided into spherical layers. 


8 


outer radius r-+dr, is = - X vol. of shell. Since the volume of 
the shell isequal lo 4r? dr, the energy in this spherical layer is given 


, : 2 
by the simple expression 5 ar. To determine the total energy of 
the field, this expression must be integrated from a (radius of the 


spherical charge) to infinity. Thus, 


ae is the form of the energy formula for an electrically charged 
sphere. 

We shall leave it to the reader to show that if the charge is dis- 
tributed throughout the volume of the sphere the energy formula 
obtained is almost the same. The only difference is that in this case 
the formula contains a coefficient close to unity. 

What is the result if the above formula is applied to an elementary 
particle, e.g., an electron? 

According to the principle of relativity (see p. 423), the internal 
energy of a body of mass m is given by the expression mc*, where 
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c is a universal constant equal to the propagation velocity of electro- 
magnetic waves in vacuum. Equating the two energy expressions, 
we obLain the formula for the electron radius: 


q? 
a= Dme2 rs 
Substituting numerical values* in this interesting formula, we 
obtain a = 1.4 x 10-8 cm. There is considerable indirect evidence 
in physics that the order of magnitude of the electron radius deter- 
mined in this manner is quite correct. 

Nevertheless, the conception of an electron as a “usual” electric 
particle is clearly false, for we are immediately confronted with the 
problem of the forces holding the component parts of an electron so 
close together. We know that the forces of repulsion between electric 
particles separated by a distance of the order of 10-! cm is tre- 
mendous. : 

Furthermore, there are other theoretical difficulties. Thus, it 
follows from the theory of relativity that an electron should be 
a mathematical point. At the same time, the electric energy of 
a charge concentrated at a point is infinitely great. 

These difficulties are typical for so-called classical physics, which 
developed in the main in the 19th century. Classical physics excel- 
lently describes the behaviour of macroscopic bodies. In fact, by the 
turn of the century many scientists believed that classical physics 
was already so perfected that there was little left to be discovered 
in physics. After the discovery of elementary particles, it was natur- 
al to try to apply the laws established for large bodies to elementary 
particles. This is when classical physics began to “fail”. We now 
know that concepts derived from observations on macroscopic 
systems cannot be simply transferred to atoms, nuclei and electrons. 

The electron problem cannot be solved within the framework of 
classical conceptions. Considerable success has been achieved in elec- 
tron theory during recent years, but a complete theory does not 
exist. Therefore, the classical theory of electricity (electrodynamics) 
presented in this part of the book has certain limitations. These 
are encountered in studying the interaction of elementary particles. 
When dealing with the behaviour of a single elementary particle in 
fields created by large bodies and, of course, when considering the 
interaction of macroscopic bodies, one obtains, using classical elec- 
trodynamics, results that are in complete agreement with experi- 
mental data. 


* q= 1.601 * 107!9  coul; m = 9.1066 x 10728 gm; and c= 


= 2.99776 x 1910 S™ 
sec 


iy) 
~ 
ee} 
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92. Electric Forces 


In calculating interaction forces between charged bodies, one often 
uses the concept of an electric force field. Instead of saying thal 
body A exerts a certain force on body B, we introduce a force field 
and say that body A creates a field and this field acts on body B. 
As we shall see in Chapter XVI, this field is more than an abstraction. 
An electromagnetic field is a physical reality and nalure implements 
the interaction transmitted from one point in space Lo another (“short 
range” action). By introducing the field concept, we can ignore the 
field sources and determine the forces acting on a charged body 
knowing only the field intensities where the charges of the system 
under consideration are located. : 

Every charged body is a system of charges. In the case of a system 
of discrete charges, the force acting on the system is I” = q,Ey-- 
+QoHo+ .... Here B,, Ho, ... are the field intensities where the 
charges are located. When the electric charge is uniformly distributed 
throughout a volume, the force acting on the body may be repre- 


sented by the following integral: 7 = . Eo dv. If the electric charge 
is distributed over a surface, the force is represented by a surface 
integral: F = \ Eods. 


However, one precaution must be taken when the force is deter- 
mined directly in this manner, namely, the value of the intensity used 
in the formulas must be the intensity existing in the absence of the 
charge on which the force is acting. In the formulas in which the force 
is expressed as a sum, the action of charge q; on itself does not enter 
into intensity £;, i.e., in calculating FE, the field created by qi is 
not considered, etc. The same is true for the integral formulas, i.e., 
the field intensity under the integral sign is the intensity created 
by the entire distribution of electric charge, except for the quan- 
tity of electricity located at the point under consideration. 

Let us illustrate this by means of the force acting on a charged met- 
al surface. As we know, the electric field intensity on the surface of 
a metal bounded by a dielectric is equal on the dielectric side to 
4n0 . , . F 2 
— in Gaussian units, and on the metal side is equal to zero. The 
field intensity is broken on this surface. To determine the force acting 
on an element of surface, we must multiply the quantity of electric- 
ity odS by the intensity which would exist at this location if the 
element of charged surface under consideration were removed. There- 


: Ar 
fore, it would not be correct to multiply odS either by << , the 


value of the field on the dielectric side, or by zero, the value of 
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the field on the metal side. It can be shown rigorously that the 
field existing at this location after removing the element under con- 
ANO 


sideration is equal to the arithmetic mean of 0 and ——, i.e., is 
210 : 
equal to = . Thus, the formula for the force acting on an element 
of a conducting body’s charged surface has the form 
2no02 
dS, 
& 


and for the entire body ; 
F=2n \ — ds. 


The integration in the above formula must be performed over the 
enlire surface, taking into consideration differences in charge den- 
sity and_ dielectric con- 
stant along the metal sur- 
face, 

In the case of a uniform 
field (ideally, between the 
plates of a condenser of in- 
finite extent), the force F 
acting on the plate area S 


may be determined with- + 
considerable accuracy by 


the formula 
It02 Fig. 94 
F= —— Ss 


The magnitude of this force may be measured by means of a Thomson _bal- 
ance, whose method of operation is illustrated in Fig. 94. When the potential 
difference between condenser plates of 50 cm? area is 600 volts and the distance 
between plates is 5 mm, the force of attraction between the plates, calculated 
pe the two systems of units thal we have been using, may be determincd_as 
ollows: 


Gaussian Units MKS Units 
eS e981 10-9 5K 10-3 
eee Sia Cg = Sen SXF = 
g=CU=8 X2=16 statcoulombs =8.9 x 10-12f 
o = 0.32 statfarad/cm?2 q=CU=8.9 x 10712 x 600 = 
2102 . 2X 3.14 x 0.322 =5.3 x 10° coul 
po gaa x 6 = 1.06 x 10-8 coul/metre? 
o2 
* 50 = 32 dynes EI pee S= 


_ (1.06 x 10-62 x5 x 1073 


, | 108 a 
2x1| ae | 


= 32x 10-5 newton 
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Thus, in order to balance the force of electrostatic attraction, one must place 
a 33.3 mg weight in the opposite pan. 

It is still more difficult to determine the force acting on a body 
having a distribution of volume charge. Here, in the expression 
o£ du, the intensity His the intensity of the field created by all the 
charges except pdv. 

If the charged body is in a dielectric medium, calculation of the 
force is complicated by the fact that when we consider the charge to 
be removed it is also necessary to consider a corresponding portion 
of the dielectric removed, which means the polarised state changes 
(see below). 

If we wish to avoid the difficulties connected with “subtracting” 
the effect of the charge on itself, the force must be determined from 
the energy expression. The decrease in energy is equal to the work. 
Then, if we know the magnitude of the displacement, we can deter- 
mine the value of the force. As a rule, this is precisely the method 
used in determining the force. 

Calculation by this method of the force acting on the plate of 
a parallel plate condenser, F = _ S, may serve asa vivid illustra- 
tion of the above. Observing the attraction between the plates of 
the condenser (disconnected from a source of voltage), we can imme- 
diately write the expression for the change in energy when the 
plates come together by an amount A: 


SA se eal laas 
af & 


Hence, the force sought is 


fa SO gy. 
& 


93. Dipole Moment of a System of Charges 


Let us return to electrical systems that may be represented as sys- 
tems of point charges. Assume that the electric field is uniform in 
the region of the system of charges under consideration. Then, the 
formula for the force acting on the system has the form 


F=(y+@+...) B=QE, 


where Q is the total charge of the system. If a body is electrically neu- 
tral, as in the case of an atom or a molecule, the force acting on the 
body, which contains equal quantities of positive and negative parti- 
cles, is equal to zero. Does this mean that an electrically neutral body 
does not interact with the electric field? It is easily seen that the 
answer is no. In a uniform field, the forces acting on the charges of the 
system are parallel to each other. We can determine the resultant of 
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the forces acting on the positive charges and the resultant of the 
forces acting on the negative charges. As is well known, the resultant 
of parallel forces is exerted at the centre of “gravity” of a body. 
Since we are dealing here with the electric centre of gravity, the 
word “gravity” has been placed in quo- 

lation marks. Thus, all the forces acting ——————_______» 
on the charges of a system located in a 
uniform field may be reduced to two 
antiparallel forces. One of the forces is 
applied at the centre of gravily of the 
positive charges and the other at the 
centre of gravity of the negative charges 
(Fig. 95). If the system is electri- 
cally neutral, the two forces’ are equal 
and the total force is zero. However, a 
couple of forces of moment WW=qEi sin a 
will act onthe system of charges if the 
centres of “gravity” of the positive and 
negative charges are displaced with re- 
spect to each other. 

The vector 72 = gl; equal in magnitude to the product of the 
positive charge of the system and the distance between the centres 
of gravity, is called the dipole moment of the system. It is considered 
to be directed from the negative centre to the positive centre. The 
dipole moment of a system determines its behaviour in a uniform 
field. Thus, a system left to 
itself in a uniform electric 
field tends to turn until the 
dipole moment is parallel to 
the direction of the electric 
field (sin @ = Q). 

In a uniform field, the entire 
effect on a neutral system of 

Fig. 96 electric charges is reduced to 

the moment of force M = pE 

sin «, where p is the dipole moment of the system and is equal to 

the product of the quantity of electricity of one sign and the dis- 

tance between the dipole charges. Thus, in a uniform field, there is 

no need to consider the complex distribution of a particular 

system of charges. It suffices to replace it by the corresponding 
dipole. ‘ 

If the system is located in a nonuniform field, the dipole moment 
can no longer completely describe its behaviour. This is illustrated 
in Fig. 96 where four charges, located at the corners of a square, 
comprise an electrically neutral system having a dipole moment 


Fig. 95 
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equal to zero. This is because the centres of gravily of Lhe negalive 
and positive charges coincide. In a uniform field, neither a force 
nor a moment of force acts on such a system. In a nonuniform 
field, however, it may undergo translatory as well as rotational 
motion, since generally speaking the forces acting on the charges 
differ. By analogy with the dipole, such a system of four charges is 
called a quadrupole. Anoth- 
er neulral system having 
zero dipole moment, called 
an octupole, is also shown 
in Fig. 96. 

Of great importance in 
r the study of the structure 
of matter, to which we 
shall devote a great deal 
of attention later, is the 
consideration of the inter- 
action of simple electric systems. Let us consider several such 
systems. 

Charge-Charge. The interaction between two point charges is, in 
accordance with Coulomb’s law, F = ie : 


Charge-Dipole. A dipole left to itself tends to turn until it is par- 
allel to the lines of force. After it has turned, the dipole remains 
stationary in a uniform field, while in a nonuniform field it will be 
drawn toward the region of greater field intensity (Fig. 97). If the 
nonuniform field is the field of a point charge, the dipole will be 
drawn toward this charge. This force of attraction is 


Be@\ ean)? 


Assuming the distance between the dipole charges. to he small, and 
reducing the expression to a common denominator, we obtain the fol- 
lowing interesting formula if we neglect the quantity J? relative 
to rl and the quantity rl relative to r?: 


/ 


Fig. 97 


Note that the force of interaction between the charge and the dipole 
decreases more rapidly with increasing distance than the Coulomb 
force, i.e., it is inversely proportional to the distance cubed. 


Example. The distance between the H atom and the Cl atom in an HCl 


molecule is equal to 1.28 A, and the dipole moment of the molecule is p = 
= 6 X 10718 statcoul cm. Therefore, an electron located at a distance r= 


= 10 A from the molecule is attracted to it with a force of ~ 6 x 16-6 dyne. 
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Dipole-Dipole. Here, it is convenient to solve the problem for 
the two cases in which the dipoles are arranged as shown in Fig. 98. 
The exact interaction formulas have the form: 


Fs -- = === “(for arrangement (a) 


i. SR 
Pr: Boar for arrangement (b). 


If the distance between the charges of a dipole is small the above 
formulas may be replaced by the 


following approximate expressions: 
. dp? a 
F= ae for arrangement (a) r ~ 


Gp2 
F= — for arrangement ()). 


j}x— ¢ 
Thus, the interaction forces are 9) 
inversely proportional to the | z 
fourth power of the distance. 


Example. Two HCl molecules that 
are 10 A apart are attracted with a force 
F ~ 107 dyne in the case of arrangement (a) and with a force F ~ 
~ 2% 407-6 dyne in the case of arrangement (b). 

Charge-Quadrupole. Assume 
a that the orientation is as 
shown in Fig. 99. The interac- 
tion force may then be 
written in the form 


Fig. 98 


rea? r ) 
: 


P= 2900 Gama apa 
and the approximate formula 
for a small quadrupole is 
2 . 
t pa ie . Thus, the force is 


Fig. 99 inversely proportional to the 
fourth power of the distance. 


94. Polarisation of an Isotropic Dielectric 


As we know, when a region containing an electric field created by 
a system of charges is filled with a homogeneous dielectric, the 
field intensity andthe magnitude of the electric potential are de- 
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creased toe of their original values. On the other hand, the electric 


displacement and induction remain unchanged, and the capacitance 
of a condenser increases to € times its original value. The latter 
effect is often utilised in the measurement of dielectric constants. 
Thus, the ratio of the capacitance of a condenser containing a di- 
electric between its plates to the capacitance of the same condenser 
without dielectric may be used as a definition of dielectric constant. 

Let us now go a step further and inquire into the reasons why the 
dielectric affects the electric field. The following experiment sug- 
gests the explanation for this phenomenon. 

Consider a parallel-plate condenser connected toa source of volt- 
age. The electric charge density on Lhe condenser plates and, hence, 
the number of D-lines per unit area are uniquely determined by the 


electric field intensity, i.e., c=. Let us now fill this condenser 


with a homogeneous dielectric. The relation between the electric 
field intensity and the charge density on the condenser plates is then 
expressed by the equation c= , i.e., the flux density (or D-lines) 
increases. In this experiment, tle electric field intensity cannot change, 
for it is equal to the potential difference divided by the distance 
between the plates. Therefore, the charge density on the condenser 
plates changes, i.e., it increases to ¢ times its original value. This 
increase may be observed experimentally. Thus, as we fill the con- 
denser with the dielectric, the voltage source adds charge to the 
condenser. By measuring the electric current and the time of flow, 


one can show that the quantity of electricity added per unit con- 
denser area is 


eE E _ e—1 
An 4m AN 


E. 


As we remove this dielectric, the additional charge returns to the 
source and the additional force lines disappear. To explain the 
additional attraclion of charge to the condenser plates, one must 


assume that charges of opposite sign, having a density c= Be 


are formed on the dielectric surface next to the condenser plates. 

The surface charge of the dielectric may be explained if we assume 
that the dielectric consists of bound pairs of positive and negative 
charges that cannot move through the body, but can move relative 
to each other, forming thereby a dipole moment in each unit volume 
of the dielectric. This transformation of an electrically neutral 
system of charges into a system having a dipole moment is called 
polarisation, and the dipole moment vector of a unit volume of 
dielectric is called the polarisation vector P. 
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Polarisation of a dielectric does not produce volume charge. The 
numbers of posilive and negative charges per unit volume remain 
equal to each other after displacement. However, polarisation does 
produce a charge on the dielectric surface. This is illustrated sche- 
matically in Fig. 100. The density of this charge is equal to the 
value determined above, i.e.: 


& 


—i1 mY 
OpoL = aS E. 


In our discussion, we have considered a dielectric adjacent to the 


plate of a parallel-plate condenser. However, the situation is the 
same for a conductor surface 


of any shape. Moreover, it 

turns out that the above lisbalalaldalrdititilatans 
expression for Opoz is of gen- 

eral validity for surfaces l i (l K (| (] ll fl (] ( l l i 


perpendicular to lines of 
FETT THT PA PT PEPE PP 


force. Thus, in every case, 


e—l 
eee EEY 2 
44 7" 


where E, is the projection 
of the intensity on the nor- Fig. 100 
mal to the surface. This for- 
mula is applicable to any real or imaginary boundary in a dielectric. 
A polarised charge (also often called a bound charge) may be 
expressed in terms of the dipole moment of a unit volume. When 
a field is applied in the case of isotropic bodies, the displacement of 
the bound charges occurs along the electric lines of force. Therefore. 
the polarisation vector is parallel to the intensity vector. From 
a dielectric plate, let us cut out a cylindrical rod having a base S 
and a length 2. Owing to polarisation, equal and opposite bound 
charges will accumulate at the ends of the cylinder. The dipole mo- 
ment of the rod is equal, by definition, to the product of the charge 
oS and the distance / separating the charges of the dipole, i.e., 
P = Opoz SI. The dipole moment of a unit volume is |P| = opor. 
It has been assumed in this calculation that the base of the cylin- 
der is perpendicular to the polarisation direction. If the base is 
inclined at an angle a with respect to this position, the charge densi- 
ty on the ends of the rod will decrease as the cosine of the inclination 
angle. Thus, in the general case, the following relationship holds: 


Opor =Pn, where P, =P cosa. 


We are now able to establish a relationship between the polari- 
sation vector and the electric field intensity vector. Combining the 
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last formula with the expression for the density of bound charges 
discussed at the beginning of this article, we obtain for any direc- 
tion nm: 

P, =" En. 
Thus, if the dielectric constant does not depend on the intensity, 
a linear dependency exists between the vectors P and E: 


P=cak. 


The expression « = £—! is usually called the electric susceptibil- 
Pp in y 


ity. For water « = 6.38 and for glass a = 0.48. 
Since D = cL, the relationship between the vectors D, # and P 
may be expressed in the form 


D=E+4aP. 


When the medium is homogeneous, the vectors D, E and P are 
parallel. 


95. Polarisation of Crystal Substances 


Hitherto we have considered the behaviour of a substance that is 
characteristic of an amorphous or finely crystalline body, or of 
a monocrystal in certain special orientations relative to the field. 
However, if a plate is cut from a monocrystal at an arbitrary angle 
to the crystal faces and if the plate is then placed betweén the con- 
ductors of a condenser, the following effect may be observed: The 
plate becomes polarised perpendicular as well as parallel to the 
lines of force, so that P is not parallel to #. Therefore, in this 
case, D is also not parallel to the field intensity. 

In monocrystals, the inclination direction of a free electric charge 
(E) does not coincide with the direction of the normal to the surface 
oriented in such a manner that a maximum charge (D) is induced on 
it. The relationship between D and EF becomes more complex and 
in order to find D in terms of #, or vice versa, it is insufficient to 
merely know the dielectric constant. In any monocrystal, three 
directions (main axes) in which J || Z may be found. If we know 
é for these three directions, the relation between D and Hi for any 
arbitrary orientation of the crystal in a field may then be established. 
How are the vectors D, £ and P related in this case? It turns out 
that the equation P = £ + 4nP, introduced in the previous 
article for the case of parallel vectors, is also valid when the vectors 
cease to be parallel. There is another difference between crystals 
and amorphous bodies in regard to their dielectric properties, 
namely, a relatively small class of bodies belonging to crystalline 
substances possess hysteretic properties. Since these properties 
were first discovered in Seignette (Rochelle) salt, such substances 
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are called seignettoelectric materials. Their characteristics will 
now be discussed. 

Place a seignettoelectric material (for simplicity assume that 
we are dealing witha powder or crystal oriented in the field so that 
D || £) between the conductors of a condenser. Let us vary the 
voltage between the condenser conductors, and hence the field 


intensity E =<, and measure the charge density o on the con- 


denser conductors, which for D || # yields the magnitude of the 
electric induction D. The magnitude of the induction D increases 
as E increases, but it is not directly pro- 

portional to EZ, for D begins to increase D 

less until, finally, saturation sels in. 

Clearly, saturation of D corresponds to 

saturation of polarisation. Let us now 

begin to decrease the emf between the 

conductors. Displacement and_ polari- 

sation begin to decrease and the curve 

follows a downward path, but not the = 
same one taken during the period of rise. 

As a result, when the emf is completely 

removed (£ =0), the induction and 

polarisation in the dielectric are not 

equal to zero. The dielectric becomes 

similar to a permanent magnet. It will 

have “north” and “south” electric poles 

and will behave like a large perma- Fig. 101 

nent dipole. 

The subsequent behaviour of seignettoelectric materials is evident 
from the hysteresis loop shown in Fig. 101. To “de-electrify” the 
dielectric, the polarity of the emf on the condenser conductors must 
first be reversed. Then, by increasing & in this new direction, 
we can depolarise the dielectric. A further increase in E& again 
electrifies it, but with opposite polarity Finally, saturation sets in 
again and the process may be repeated in the reverse direction. 

Why is this effect called hysteresis? The word is derived from the 
Greek and means “to lag”. The loop illustrated in the figure shows 
that the values of D, as well as of P and e, depend on the past state 
of the sample, i.e., on its history. ; 

Every crystal that does not possess a centre of symmetry in a 
number of its elements of symmetry (see p. 590) possesses an inter- 
esting property, namely, its dimensions change upon application of 
an electric field. This phenomenon is known as electrostriction. 

Thermodynamic considerations show that if an electric field pro- 
duces a deformation, the deformation in turn will produce polarisa- 
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tion. This is known as the piezoelectric effect. Applications of the 
piezoelectric effect were briefly discussed in Part I. The relation of 
this effect to the structure of matter will be discussed on p. 657. 


96. Finite Dielectric Bodies in an Electric Field 


The following questions may arise regarding a finite nonconducting 
body located in an electric field. What forces and moments of force 
act on such a body? How is the field distorted by the presence of the 
dielectric? 

A dielectric body placed in a field becomes polarised and acquires 
a certain dipole moment. Therefore, the behaviour of such a body 
in an electric field does not, generally speaking, differ from the 
behaviour of a dipole. If the polarisation vector is directed at an 
angle to the field intensity, the 
orientation of the dielectric will 
be unstable. A moment of force 
will act on the body, which will 
tend to turn the body until the 
vectors P and E are parallel. 

Thus, a _ dielectric body 
placed ina given uniform electric 
field assumes a definite equi- 
librium orientation that de- 
pends on the form of the body 
Let us illustrate this for the case 
of a dielectric bar. 

Experiments show that the 

Fig. 102 equilibrium orientation is the 

one for which the longitudinal 

axis is parallel to the lines of force. Why is this so? Is it not true 

that the bar does not have fixed poles? Fig.102 illustrates the reason 

for this peculiar behaviour. The forces acting on the bound charges 

of the depicted rectangular bar-may be reduced to four forces acting 

on four surfaces of the bar. We see that the forces acting on the 

longitudinal surfaces almost balance each other, while the forces 

acting on the lateral surfaces form a couple of forces that orients the 
bar parallel to the lines of force. 

If the body is in a nonuniform field, then, in addition to the mo- 
ment of force, there will exist forces tending to pull the dielectric 
toward the region of greater field intensity. This phenomenon may be 
vividly demonstrated by making a dielectric fluid rise in a tube upon 
applying voltage to a condenser. The forces making bits of paper 
cling to a glass or ebonite rod rubbed with fur or leather are of the 
same kind as those acting on a dipole in a nonuniform field. 


F444 
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Let us now turn to the question relating to the distortion of an 
electric field due to the presence of a dielectric body. First, we shall 
show that the general laws for an electric field lead to an important 
relation between the values of the electric field on either side of the 
boundary between dielectrics. 

The electric field intensity vectors at two neighbouring points 
located on opposite sides of the boundary between dielectrics having 
permittivities ¢, and ¢€. differ E,< Ep 
in magnitude as well as in u=E \ \ 
direction. Let us_ resolve re 
these vectors into compo- 
nents parallel and normal 
to the boundary. It can be 
asserted that the field par- 
allel to the boundary is the 
same on both sides. If this 
were not the case, i.e., if the 
field on one side were great- 
er than the field on the 
other side, it would be pos- 
sible to create a perpetual 
engine by moving charges 
arene the boundary against 
the field where the field is 5 Eat 
less and then allowing the Dn2=Dp NX Dia" €ef eg 
charge to move on the other Fig. 103 
side of the boundary (where 
the field is greater) under the action of the electric field forces. 
Therefore, the tangential components of the intensity on both sides 
of the boundary must be equal: 

By, = En. 

We shall use the Gauss-Ostrogradsky law to determine the normal 
components of the intensity at the boundary between two media. 
Construct an auxiliary surface in the form of an infinitely thin 
disk so that the parallel surfaces of the disk lie on opposite sides 
of the boundary. Since there is no charge inside such a disk, the 
net outward flux through the disk is equal to zero and, therefore, 
the flux through each end is the same. This requires that the normal 
components of the induction vectors be equal to each other, i.e., 
D,, = Dy,. Hence, in terms of field intensities, we obtain 


EE, => E2En,. 
Thus, the normal components of the intensity vectors are inversely 
proportional to their permittivities. 
17* 
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Fig. 103 shows that in passing from a medium of lower permittivi- 
ty to one of higher permittivity the flux lines are deflected away from 
the normal to the boundary. This means that the number of flux 
lines passing through a unit area increases. 

We are unable to determine the distortion produced in an electric 
field when a dielectric having a particular shape is introduced into 
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Fig. 104 


this field. This problem is difficult even when the field is uniform to 
begin with. If a body of arbitrary shape is placed in such a field, 
the field becomes nonuniform not only near the body, but inside 
the body as well. 

Interesting exceptions are ellipsoids, a broad class of bodies includ- 
ing spheres, flattened ellipsoids that practically do not differ from 
plates, and extended ellipsoids that are akin to cylindrical bodies. 
In mathematical physics it is shown that the field inside an ellipsoid 
is uniform as indicated in Fig. 104. Applying the law of flux refrac- 
tion, we obtain the typical fields illustrated for a denser body in 
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a less dense medium (€, < €2) as well as for the reverse case (€; > 2). 
Examples are a glass ellipsoid in air and an air bubble in glass, 
respectively. 

It can be shown that if a symmetrical dielectric body is immersed 
in a uniform field Z, in vacuum, then £; is related to the field inside 
the dielectric as follows: 


E; om E,—NP, 
where P is the polarisation vector and N is a coefficient depending 
only on the shape of the body. In the case of magnetic phenomena, 


it is customary to call the latter the demagnetisation coefficient 
(see p. 292). 
eE— 


Since in most cases P = Fz 
d 
sion after simple conversion: 


L,= 


d E;, we obtain the following expres- 


Fo 
N * 
Pea) Gs 


The dielectric constant is always greater than unity. Hence, the 
field intensity inside the dielectric is always less than the field inten- 
sity present at this location before the dielectric was introduced 
into the field. 

The coefficient NV for a flat plate perpendicular to the field is equal 
to 4x. This is the maximum value for N and the resulting decrease 


in field to + of its original value agrees with the result obtained ear- 


lier for a homogeneous medium. Let us take another extreme case — 

a cylinder whose axis is oriented parallel to the field. Here, VN = 0, 

i.e., the field does not decrease in-such a body. In all other cases, 

the decrease in the field intensity depends on the dielectric constant. 
4 

For a_ sphere, N=> and, therefore, #; = oes . For a cylinder 

whose axis is oriented perpendicular to the field, NV == : 


The field intensity E decreases because the bound charges create 
a field of opposite direction. 

As regards the induction field, the bound charges affect it only 
indirectly. Thus, the number of D-lines remains unchanged when 
a dielectric is immersed in the field. However, due to flux refraction, 
the induction inside the dielectric increases. 


CHAPTER XV 


MAGNETIC FIELDS 


97. Magnetic Moment 


Magnetic fields act on currents, moving charged bodies or parti- 
cles and magnetised bodies. A variety of instruments exist for deter- 
mining the properties of a magnetic field. The most convenient way 
of characterising the properties of such a field is to describe its 
mechanical action on a current circuit. It is quite feasible to con- 
struct a wire circuit of very small area, which enables us to measure 
the magnetic field quite accurately. Thus, a “test” current circuit 
plays the same role in magnetic field theory as a “test” charge does 
in electric field theory. 

Experiments with such a device lead us to the following basic 
conclusions. At each point of the field, a circuit that is free to rotate 
assumes a definite equilib- 
rium position. The position 
of stable equilibrium is de- 
scribed not only by the orien- 
tation of the ‘circuit axis in 
space, but by the orientation 
of a definite side of the circuit, 
e.g., the side for which the cur- 
rent will appear to be flowing 
counterclockwise as viewed by 
an observer on that side of the 
circuit. Let us call this side 
positive or north and agree to 
draw the normal to the cir- 

Fig. 105 cuit so as to form a right-hand 

screw system with the current 

direction. Thus, the normal emerges on the positive (or north) side 
of the circuit. 

If the behaviour of current circuits is compared with that of mag- 
netic needles, one observes that the normal to a circuit in stable 
equilibrium points in the same direction as a magnetic needle. 
Thus, the basic definition is not contradicted if we call the direc- 
tion of the normal to a free test circuit the direction of the magnet- 
ic field. 

A torque will act on a test circuit that’deviates from the equilib- 
rium position (Fig. 105). Moreover, the deviation of the circuit from 
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equilibrium is uniquely described by the deviation of the normal to 
the circuit from the direction of the field. It turns out that the sine 
of angle a and the torque VV are proportional to each other, i.e., 
N ~ sina. Furthermore, for a particular angle a, the torque is pro- 
portional to the product of the circuit area S and the current J flow- 
ing in the circuit. Decreasing the area by a certain factor results 
in the same change in torque as a decrease in current by the same 
factor. 

It follows from the above that the magnetic behaviour of a circuit 
depends on the orientation of the normal to the circuit and on the 
magnitude of the product JS. This can be expressed by means of 
a single vector quantity—the so-called magnetic moment of the ring 
current. In electrical engineering, the magnetic moment is generally 
designated by the vector Mf = JS, where 7 is the unit normal vec- 
tor. In the Gaussian system of units used in physics, a constant of 


proportionality - enters into this formula, i.e., M =+ Sn, where 


c is the velocity of propagation of electromagnetic waves in vacuum. 
The introduction of a numerical coefficient, which is moreover dimen- 
sional, may appear to be an unnecessary complication. However, 
other formulas are thereby simplified. This will become clear to 
the reader later on. 

Experiments with a test circuit show that VN = BM sina, where B 
is a constant of proportionality. The volume of B varies from field 
to field and for different points in space of a particular field. This 
formula shows that B is equal to the maximum torque acting on 
a unit test circuit (MM = 1). We call this coefficient B, which charac- 
-terises the magnetic field, the magnetic induction. The vector quantity 
whose direction is that of the magnetic field and which is numeri- 
cally equal to B is known as the magnetic induction vector. 

If the torque is described by a vector directed along the axis of 
rotation (in accordance with the right-hand screw rule), the formula 
for the torque may be written in vector form as follows N = [MB]. 

When NV = 0, I is parallel to B. This means that a current circuit 
tends to become so oriented in a magnetic field that the directions 
of the magnetic moment and the field coincide. The magnetic moment 
acting on a body is a maximum when it is perpendicular to the 
direction of the field. For a circuit, this means that the plane of 
the loop of wire is parallel to the flux lines. 

Having determined the magnetic field by means of a current cir- 
cuit whose magnetic moment has been calculated from measure- 
ments of the current and the area, we can then do the reverse, namely, 
use the formula VV = [ZB] to determine the magnetic moment of 
systems whose currents cannot be measured. Moreover, we can 
transfer the concept of magnetic moment to systems in which the 
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concept of a ring of electric current has no meaning. This is precisely 
the case when the physicist refers to the magnetic moment of an 
electron or nuclear particle. The magnetic moment of a magnetic 
needle is also a concept that cannot be broken down. However, 
after having discussed certain special effects of the medium, we shall 
return on p. 270 to the magnetic moment of a permanent magnet. In 
any case, the magnetic moment of a system located in vacuum can 
always be determined by the above formula for the torque. 

A body possessing a magnetic moment requires the expenditure 
of work to turn it from its equilibrium position. In the case of a body 
turned through a'small angle a, the work of rotation may be ex- 
pressed in the form 


Nda=BM sinada= —d(BM cosa). 


The deviation of a body from the equilibrium position is associat- 
ed with the accumulation of a “potential” energy U = — BM cosa. 
This product is the scalar product of two vectors. Hence, U = — BM. 

In the equilibrium position, the potential energy is a minimum 
and equal to —BM. When the magnetic moment is turned through 
an angle of 90° the potential energy increases to zero. Finally, when 
the magnetic moment is directed oppositely to the field (position of 
unstable equilibrium), the potential energy is a maximum and 
equal to + BM. 


Examples. 1. The magnetic moment of the nucleus of a hydrogen atom (nu- 
clear magneton) is 0.505 & 10-23 Gaussian unit. The magnetic moment of an 
electron (Bohr magneton) is 0.927 x 10720 Gaussian unit. 


2. An electric current of 1a flowing in a loop whose area is 50 cm? creates 
a magnetic moment of 5 x 1073 a metre? = 5 Gaussian units. 

3. Inthe Gaussian system of units magnetic induction is measured in gausses 
while in the practical system B is measured in volt sec/metre?; 4 volt: 
sec/metre? = 104 gausses. In the case of the magnetic field of the Earth, B = 
= 0.49 gauss. 


4. The magnetic induction in the air gap of a powerful clectric generator 
attains a value of several thousand gausses. Academician P. L. Kapitsa has 
obtained pulsed magnetic fields in which B ~ 105 gausses. 


98. Ampére Force 


The torque acting on a current-carrying circuit is clearly the 
resultant of the forces exerted on every part of the conductor in 
which current flows. We can experimentally establish the relation 
for the force acting on a current element. For this purpose, it is 
necessary to isolate a part of the circuit—e.g., by means of mercury 
contacts. This part is then able to move under the action of a force. 
Utilising the tension of a spring to counterbalance the displacement, 
one can measure the magnetic force (Fig. 106). 
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Ampere first established the relation for the force acting on a cur- 
rent element of small length. This relation has the following form: 


I ae I ee ao 
dl’ =— (dl, Bj, i.e, dF =— dl x Bsin dl, B. 


The vector notation here is suggestive of the familiar left-hand 
rule. The force acting on an element of wire length is always perpen- 
dicular to the plane passing through the current and the magnetic 


Fig. 106 


induction vector at this location. To determine the sense of the 
force, note from which side the rotation of vector dl toward vec- 
tor B, through the smallest angle, appears counterclockwise. This. 
is the positive side in a right-hand screw system and the force vector 
then points toward the observer. The force has a maximum value 
when the current element is perpendicular to the vector field. When 
the wire element is parallel to the flux lines, the force is equal to zero. 

The above formulas are in the form used in physics, i.e., valid 
for the Gaussian system of units. In the form used in electrical 


engineering, the coefficient ~ is absent- and the formula for the 
Ampere force is 
dF =TI{dl, B). 
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To determine the magnitude of the force acting on a piece of wire 
of finite length, one must integrate the above expression for the force: 
pa! \ [dl, B}. 


c 


In the simple case of a rectilinear piece of wire of length 1, locat- 
ed in a uniform magnetic field B, Ampére’s law may be directly 
applied in the form 


4 mer an 
F=— IIB sin lt, B. 


A perfectly natural relationship exists between Ampére’s law and 
the torque expression derived in the preceding article. We shall 
I consider only the simple 
case of a rectangular loop 
oriented parallel to the flux 
lines in a uniform magnetic 
field (Fig. 107). Two sides 
of the loop are perpendicu- 
lar and the other two sides 
are parallel to the flux lines. 
Therefore, the forces acting 
on the wire elements may 
Fig. 107 be reduced to the two shown 
in Fig. 107. These forces are 
equal and in accordance with Ampére’s law may be written in the form 
¥ = IIB. As can be seen from the figure, the Ampére forces create 
a torque N = JiBd. But since Id = S is the area of the loop, we 
obtain N = JSB = MB, which is the same as the formula for 
the torque derived in the preceding article. We leave it to the 
reader to derive a more general proof. a 


Example. The force acting on a conductor whose length iy 3 metres and 
through which a current of 50 a flows in a field of 3,000 gausses = 0.3 volt 
sec/metre? is F=BI1=0.3 X 50 X 3 = 45 newtons ~ 4.5 kg. In the case of 
a rotor diameter of ~1 metre, the torque acting on the loop is ~ 45 newton 
metres. These values are of the order of magnitude of the parameters of a large 
electric motor. In an electrical measuring instrument, a force F = 2 X 1075 new- 
ton = 2 dynes ~ 2 mg acts on a conductor of length 2 cm through which 
a current of 0.01 a flows in a field of 100 gausses. For a loop diameter of ~ 1 cm, 
a torque of ~2 X 10-7 newton metre acts on the loop. 


99. Force Acting on a Moving Charge 


We may go a step further and consider the magnetic forces acting 
on currents as forces applied to elementary particles of electricity. 
Electric current issimply a flow of electric charge. If e is the par- 
ticle charge, vw the particle velocity and 7 the particle concentration 
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{i.e., the number per unit volume), then the expression for the cur- 
rent intensity may be expressed in the form J = nevS. Thus, all the 
particles in a volume vS pass in 1 sec through the wire cross-sec- 
tion S, i.e., the quantity of electricity flowing is equal to nevS 
(Fig. 108). Substituting this expression in the formula for Ampére’s 
law, we obtain: 


dF =—[vB]nS dl. 


But nS dlis the number of particles in the conductor volume under 
consideration. Thus, the force acting on one particle is 


f =—[vB). 


This force is sometimes called the Lorentz force—in honour of Lo- 
rentz, the distinguished physicist who contributed so much toward 
the development of the theory of electrons. 

The above expression for the force (we shall restrict ourselves to 


the form used in physics, i.e., with the coefficient 4) immediately 


Fig. 108 


provides the answer to a number of very interesting questions regard- 
ing the nature of motion of electric particles (e.g., electrons and 
protons) in a magnetic field. The force acting on a moving particle 
is perpendicular to the flux lines and to the particle velocity vector. 
If a particle moves parallel to the flux lines, no force is exerted on 
‘it. On the other hand, the force is a maximum when the motion 
occurs in a plane that is perpendicular to the flux lines. In this case, 


we obtain f = = evB. 


If the field is uniform, an electric particle moving perpendicular 
to the field will describe a circle, for according to the fundamental 
law of mechanics this is the nature of the motion under the action 
of a constant force directed at right angles to the motion. We shall 
return to the problem of particle motion in a magnetic field later 
(p. 445). 
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Example. Electrons ina cathode-ray tube accelerated by a potential differ- 
ence of 70 volts acquire a velocity of 5 x 108 cm/sec. Upon entering a magnet- 
ic field of 500 gausses at right angles to the field, each electron experiences 


a Lorentz deflecting force of f = —evB = 4 X 107! dyne. Under the action 
Cc 


of this force, the’ electron begins to move in acircular orbit of radius R, where: 


2 
R is determined from the relation f = >: Hence, R = 5.6 cm. 
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Every permanent magnet has two poles.* The flux lines are direct- 
ed outwardly at the north pole and inwardly at the south pole. 
Imagine a surface constructed so that it encloses the north pole- 
We can then determine the total number of lines passing outwardly 
through this surface. By analogy with the corresponding electric 
quantity, this number is called the magnetic flux and is designated by 
the letter ®. The flux through an elemental area perpendicular 
to the flux lines is equal to dD = BdS, . Thus, through any arbitrary 
area, dD = BdS cos a, where a is the angle formed with the flux 
lines by the normal to the area; and through the surface S, D = 


=(B cos @ dS. Finally, through the closed surface, @ = 
= § Boos a dS. 


The flux My, directed outwardly at the north pole and inwardly 
at the south pole, is the fundamental characteristic of a magnet. 
The stronger the magnet, the greater Dy. This somewhat justifies 
the designation “quantity of magnetism”—which is only of histori- 
cal significance—for the quantity proportional to the flux, namely, 


1 P < ‘ 3 
te @. Sometimes m is called the magnetic mass, but this term 


is even less appropriate. In electrical engineering units, m = QO. 
If the poles of a magnet are small (e.g., in the case of a magnetic 

needle), the flux lines close to the poles are directed radially. 
Using the Gauss-Ostrogradsky theorem, 


§D cosa dS = 4nq, 


we derived the formula for the electric induction of an isolated charge, 
namely, D = +. Clearly, an “isolated” magnetic pole should 
yield a magnetic induction satisfying the analogous equation: 


B= = , since $B cosadS = Anim. 
; 


* The creation of magnets with any number of pairs.of poles is also col- 
ceivable. 
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In the case of the electrical engineering formulas: 


m. 7 : 
=——;, since § BoosadS =m. 
Amr? ‘ 


To be sure, “isolated” magnetic poles do not exist. The above for- 
mula has meaning only for long magnets having point poles, and 


Fig. 109 


then only close to the poles. Nevertheless, this method of 
dealing with the pole of a permanent magnet is fully justi- 
fied in practice. This can be clearly demonstrated by means of the 
expression for the field of a bar magnet 
Considered as a magnetic dipole whose two 
Poles m are separated by a distance 1. Fig. 109 
shows the field of a bar magnet and the ideal 
field based on the formula 


Bae m 19 
Fase ee gt 
re ry rz 72 


where a, and ro are the distances from the poles 
to the point under consideration. The fields 
are seen to be very similar. 

Calculations yield good results for the field 
at large distances from the magnet. Thus, if 
the distances r; and r2 are large relative to 
the magnet length J, the distance between 
the poles of the magnetic dipole, we are fully 
justified in considering the poles as points. Fig. 110 
The calculations are exactly the same as the 
corresponding calculations for electric interactions. Let us compare, 
for example, the values of the magnetic induction created by a bar 
magnet at a distant point along the magnet axis and at a distant 


Bz 
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point perpendicular to the axis. In the first case, we obtain 
B m m 2ml 2M 


re (rF+1)2 — 73 8 


where M = ml iscalled the magnetic moment of the permanent. 
magnet. In the second case (Fig. 110), 


m M 
B=25 CoS O= =F . 


Thus, the field along the axis is twice as large. 
In the electrical engineering system of units, the last two formulas 
assume, respectively, the following form: 


M M 
B= pasa and B= tat’ 


Example. Let us calculate the magnetic induction created by a bar magnet 
of length / = 10 cm at a distance r = 1 metre from the magnet, measured along 
the axis. The magnet cross-section is S = 3 cm? and the induction in the mag- 
net is 500 gausses. 

The magnetic flux in the magnet, which is the same as the outward flux 
from he pole, is © = 500 X 3 = 1,500 maxwells. Thus, a ‘‘magnetic mass’” 


pkey ad 120 Gaussian units is concentrated at the magnet, pole. The mag- 
netic moment of the magnet is 
M=ml= 120 x 10==1,200 ergs/gauss (Gaussian units). 
-Then, for the magnetic induction, we obtain 
2M 2x 1,200 


SS — SS 4 {073 
B 73 (1003 2.4 10-3 gauss 


101. Magnetic Field Intensity 


Let us consider the interaction of an isolated magnetic pole and 
a current element (Fig. 111). The magnetic pole creates a field B at 
the location of the electric current. Therefore, in accordance with 
Ampére’s law, the force acting on the current element is 


dF =+1 (dl, B). 


In place of the magnetic induction, we can substitute the expression 
for a point pole. Since the field is directed along the radius, we 
obtain the following expression for the interaction force: 


ar=—, 1 dl, =| 
or 


vat 
dpa dlxsindl, r. 
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It is quite natural to assume that the force with which a current. 
element acts on a magnetic pole is represented by the same formula, 
except that the direction of the force is reversed. This assumption 
cannot be directly verified experimentally since an isolated pole and 
an isolated element of constant current do not exist. However, we 
can verify the validity of the above statement by integrating the 
interaction forces in actual cases. Jt turns 
out that theory and experiment agree. 

Thus, the force exerted by a current ele- 
ment on a magnetic pole may be written in 
the form 

av=—,1{ dl, > 


cr2 
or, inthe electrical engineering system, i.e., 
F A 1 ; 
without the coefficient > and replacing m 


m e 
by zz: 


_ m oy eee er 
P= I [ az, |. Fig. 111 
A minus sign does not appear in this formula because a reversed 
radius vector has been assumed. The direction of 7 is always taken 
as the direction from the field source to the point of observation. 
Therefore, in the case of the force acting on the current, 7 was as- 
sumed to be directed from the pole to the current element. Now, when 
the force is exerted by the current on the pole, the radius vector 7 
is assumed to be directed from the current element to the pole. 
The force acting on a unit magnetic pole is called the magnetic 
field intensity: 
' aF 
dH =—. 
m 
Thus, our discussion has shown that the magnetic field intensity 
created by a current element is given by the formula: 


I , 
dH=—,| dl, =|. 


In the electrical engineering system of units, this formula for the 
magnetic field intensity created by,’a current has the form 


I Tr 
dH =; | dt, 2]. 

Thus, a magnetic field may be characterised in two different ways, 
namely, by the induction vector and the intensity vector. The for- 
mer measures the action of the magnetic field on a current and the 
latter measures the action of the field on a magnet. 
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In practice, it is easier to reduce the measurement of intensity to 
the measurement of the torque acting on a magnetic needle (Fig. 112). 
Such a needle located in a uniform field is subject to the action of 
a couple of forces, where the magnitude of the force is equal to mH 
and the arm of the couple is equal to J sin a. Hence, for the torque, 
we obtain the expression 


N=MUHsina 


In vectorial form IN = [B£H], where M = ml is the magnetic 
moment of the needle. It is seen that this formula is very similar to 
that for the torque acting on a cur- 
rent-carrying circuit. 

The relationship existing between 
the magnetic field intensity and 
the magnetic induction can be de- 
termined experimentally. It turns 
out that in all cases, except in the 
case of anisotropic bodies, the inten- 
sity and induction vectors are par- 
allel to each other. This means 
that the magnetic needle and the 
axis of the test circuit are always 
parallel. Moreover, in all cases, 
except in the case of ferromagnet- 
ic substances, a simple linear relationship exists between Hf and B, 
namely B = po, where po is a universal constant (the so-called 
magnetic permeability of vacuum) and w isacoefficient characterising 
the medium (the relative magnetic permeability of the medium). 

In the system of units used in physics, pp = 1. This yields the 
same dimensions for the magnetic induction and the intensity. 
The price that must he paid for this identity, however, is the intro- 


duction of the dimensional coefficient 4 in the formula for Ampére’s 


law. In the system of units used in electrical engineering, the mag- 
netic permeability of vacuum is 


Fig. 112 


Mo = 4 x 10-? joule/a? metre. 


102. Interactions of Currents and Magnets 


The relations considered in the preceding sections enable us, in 
principle to calculate the interactions of any magnetic system. 
We have at our disposal formulas for the forces and torques acting 
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on devices by a magnetic field of any origin: 


Action on a current 
Action on a magnet 


Physics formulas Elec. eng. formulas 
F=- [dt, BY F=I {dl, B) F=mH 
N=([MB], N= [MB], N =([(MA)j, 
where M=+ IS where AJ =IS where M=ml 


Formulas relating fields to their sources: 


Field duc to current Field due to magnet 
Physics formulas Elec. eng. formulas ee here 
ees d r ee: dl, 2 _m _ mm 
dH=4, [dt, +] dH =, |a, +] Bam B= a 
os _ 2M _ M_ 
B=pH B= top B= B= eer 


Substituting any of the lower formulas in any of the upper ones 
and using the relation B = wopH, we obtain the formulas for 
magnetic, electromagnetic, magnetoelectric and electrodynamic inter- 
action. We shall illustrate each type of interaction by an example. 

Magnetic interaction, i.e., the action of one magnet on another. 
Two poles separated by a distance r interact in accordance with 
Coulomb’s law, i.e., 


myme __ __myme 
~~ re ee 4rpigur2 * 
The interaction force is inversely proportional to the magnetic per- 
meability. 

Electromagnetic action, i.e., the action of a current on a magnet. 
A current element exerts a torque on a magnetic needle. For simplic- 
ity, we assume that M 1 H, i.e., the magnetic needle is perpendic- 
ular to the flux lines. Then, 


JN 
dN = disin dé, * or 
cr 
Via 
dN = ws dl sin dl, 7. 
Jur 


18—1409 \ 


274 Magnetic Fields 


The interaction does not depend on the magnetic permeability, 
i.e., on the properties of the medium. 
Magnetoelectric action, i.e., the action of a ‘magnet on a current. 
Consider a current-carrying circuit located along the extension of 
the bar magnet axis at a distance r 
from the magnet (Fig. 4113). The 
torque acting on the circuit is 


MeurMmag 
alee cae 


7 N=M wy /B sing = sin a 


7 or 


/ N =m sin a. 
2nr8 

a Thus, the interaction does not de- 
pend on the magnetic permeability. 


8B Example. A circuit of area S=20cm?, 
through which a current / = 10 a flows, 
Fig. 143 interacts at a distance of 100 cm with 
; a bar magnet whose magnetic moment 
is Mmag = 1,000 Gaussian units. The torque acting on the circuit is 
N= el 
r 


Meur =scm x 10 X 20X3 x 109 = 20 Gaussian units and 
N=4X 10-2 dyne cm. 


Electrodynamic action, i.e., the action of one current on another 
current. Two parallel currents are attracted with a force 


dF =41 di,B, where B= ponH = pou dlp, 
i.e., 


dF=p IIo dly dls 


cope or dF = tig tee ata ae 2 


Amr? 


The interaction is directly proportional to the magnetic permeability. 
The formulas for the interaction of magnetic systems may be 
derived in exactly the same manner. 


Example. It is essential to take into account electrodynamic interaction 
in the laying of bus bars. If a short circuit should occur, the bus bars and their 
supporting insulators must be sufficiently firm to withstand large electrodynam- 
ic forces. Assume that a gurrent I, = I, = 3 x 10‘ a flows in parallel bus 
bars separated by a distance d = 20 cm. A force F = BI = pofll acts on 
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I, : ‘ 
a unit length of each bus bar, where H= sq is the magnetic field intensity 


created by the linear current flowing in the other bus bar (see p. 279). 


Thus, 


> Mol? 4a x 10-7 X 9X 108 : 
ar AOD =900 newtons, 


i.e., a force of ~ 90 kg acts on each metre of busbar. The same result could 
be obtained by integrating the last formula for dF above. 


103. Equivalence of Currents and Magnets 


We have called attention to the fact that a similarity exists be- 
tween the expressions for the torque acting on a magnetic needle and 
the torque acting on a current-carrying circuit. As a matter of fact, 
these two systems behave extremely alike in an external field. If 
each of the systems is characterised by its magnetic moment vector, 
the likeness appears even greater. Both systems tend to become ori- 
ented in the magnetic field with the magnetic moments parallel to 
the flux lines of the field. If the magnetic moment is displaced from 
the position of stable equilibrium, a torque VY = [MH] acts on the 
system in the case of the magnetic needle and a torque N = [MB] 
in the case of a current-carrying circuit. The potential energies of 
these systems are represented by the formulas U = — MH and U = 
= —MB, respectively. 

If we recall that B = por, the difference between the two for- 
mulas becomes immediately evident, i.e., they differ only with 
respect to the magnetic permeability factor. Hence, as regards 
mechanical action, a magnetic needle of moment M is equivalent 


to a current-carrying circuit of moment M,= 


top 
However, the analogy between these two syatams goes even fur- 
ther. We shall now show that the proper fields of a magnetic needle 
and a current-carrying circuit are alike except for a constant factor. 
This similarity occurs at distances considerably greater than the 
dimensions of the system. We shall prove this for a point in space 
in line with the magnetic moment at a distance r from the centre-of 


the system. The field of a magnet for such a point has already been 
9 

calculated and found to be equal to B=4 . It remains to deter- 
mine the field of a circular loop of current along the axis of the loop. 

In Fig. 114 are shown the vectors of the field intensities due to two 
arc elements of current directed into and out of the page, respec- 
tively. The field intensity vectors are perpendicular to their corre- 
sponding current elements and radius vectors, i.e., they are in the 
plane of the page. The sense of the intensity vectors is determined. 


18* 
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from the vector product rule or, what amounts to the same, the 
right-hand screw rule. 

Since a current element and its radius vector are perpendicular to 
each other, the field created by such an element is equal in this case 


to dH = 72 | Now, let us add the two vectors depicted in the 


figure. For the field created 
by the “antipodes”, we obtain 


21 dl 
dH = —-cosB, 


where the meaning of the va- 
rious designations is evident 
from the figure. This value of 
field is yielded by every such 
pair of “antipodes”. Therefore, 
the resultant field is obtained 
from the last expression by 
replacing di, the length of an 
Fig. 114 element, by xa, half the circum- 
ference of a _ circle. Thus, 
the field intensity along the axis of a circular current, at a dis- 
tance r from the current,* is given by the formula 


2 
a ce ; 
cr 


: 1 é 7 2 
Since — 4S is the moment of {a circular current (here S = xa’), 


we obtain: Hat and B = pyro = 

Thus, we have proved that a magnetic dipole and a current-carry- 
ing circuit are not only equivalent as regards the forces acting on 
them, but also as regards the fields created by them. The nature of 
the equivalence is the same in both cases. A magnetic needle of 
moment M can be replaced by a current-carrying circuit of moment 


M 


Lio * 


In vacuum and for the system of units used in physics, i.e., for 
fo = 1, the equivalence principle becomes even simpler. In this 
case, a magnetic needle of moment M is equivalent to a current- 
carrying circuit having the same moment. 


ce 


* Since we are dealing with large distances, the difference between r and 
the distance to the centre of the system is negligible. 
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Examples. 1. Let us return to the example on p. 270 and calculate the mag- 
netic induction of the same magnet in the practical system of units: 
B=0.05 volt sec/metre2, S=3 x 10-4 metre?, 
@ =15 x 10-6 volt sec, m=15 x 10-5 volt sec, and 
£=0.1 metre, and Af =mi = 15x 10-6 x0.1=1.5 x 10-6 volt sec metre. 
Hence, 


=2.4x 10-7 volt sec/m?, 


Which is in complete agreement with the result obtained on p. 270. 
2. Consider a current-carrying circuit for which J = 5 a and S = 2 cm’. 
The magnetic field intensity created at a distance r= 50 cm (along the axis 


of the circuit and perpendicular to its plane) is H = = 
4 = 
M=— TS =3 7910 9X 3x 109x 2=1 erg/gauss 


and H —1.6 x 10-5 oersted. 


104. Rotational Nature of a Magnetic Field 


A study of the nature of magnetic lines shows that magnetic lines 
differ basically from electric fields. Electric lines have a beginning 
and an end, i.e., there are no closed lines in a constant electric field. 
On the other hand, experiments show that magnetic flux lines, i.e., 
vector lines of magnetic induction, are always closed. If other words, 
such lines have neither beginning nor end. 

For reasons discussed above, force fields in which the work along 
a closed path is equal to zero are known as potential fields. Vector 
fields characterised by closed flux lines are known as rotational 
fields. A magnetic field is a rotational field. 

If we describe a closed surface in a magnetic field, the net outward 


flux @ = B cosadS through such a surface will always be equal 
to zero. In other words, the number of lines entering this surface 
is equal to the number of lines leaving it. Thus, the equation § B 


cos adS = 0 is the mathematical expression of the fact that magnet- 
ic flux lines have neither beginning nor end. 

The magnetic lines always encircle the currents creating the field. 
Therefore, the integrals taken along induction or intensity flux 


lines, i. e., § Badl and § H dl, respectively, differ from zero. It is 


more convenient to consider the second integral, for its value is 
proportional to the magnitude of the electric current encircled by 
flux lines. This can be seen from the basic field intensity formula, 
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‘which shows that H and current strength are directly proportional 
to each other. 


By analogy with electrostatics, \ Hdl is called the magnetomotive 
force (mmf). If the integral is taken along a flux line, then 


\ #a=§ mat. 


The magnetomotive force along a closed curve is proportional to 
the current encircled by this curve: > 


4 H dl=—kl, 


where k is a coefficient of proportionality. 
A flux line may encircle more than a single current. Then, using 
the algebraic sum of the currents, the equation assumes the form 


VHdl=k DVI. 


Deeper theoretical analysis, which we are unable to go into here, 
shows that the above equation is subject to-two more generalisations. 
First, the integral need not be taken along a flux line, but can be 
taken along any arbitrary circuit. Secondly, the coefficient of 
proportionality in the equation is a constant depending only on 
the properties of the medium and is the same for all geometric 
conditions. Thus, the magnetomotive force is the same for any 
closed curve that encircles a current of specified strength. The shape 
of the curve and its length are of no significance. [t is immaterial 
whether the curve encircles one current or ten currents and whether 
these currents are rectilinear or curved. As long && the algebraic 
sum of the currents passing through the closed curve remains the 
same, so does the magnetomotive force. 

Since the coefficient of proportionality in the formula for the mag- 
netomotive force is a universal constant, we can determine / if the 
magnetomotive force can be calculated for any system whose field 
is known. 

We are familiar with the general expression for the magnetic field 
intensity of a current element, but mathematical difficulties are 
encountered in calculating the magnetomotive force by means 
of the formula for the field intensity, namely, 


dH =—,[al, =). : 


However, we are also familiar with the formula for the magnetic 
p e : . 2M 
field intensity along the axis of a circular current, namely, H=-z : 
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No special difficulties are encountered in calculating the magnetomo- 
tive force along this axis. We should not be disturbed by the fact that 
the integration is carried out along a straight line, while we are 
interested in the magnetomotive force along a closed curve. As 
a matter of fact, a straight line extending from minus infinity to 
plus infinity is a closed curve, for it is closed at infinity. The expres- 


sion for the magnetomotive force, ( H dl, along such a closed curve, 


i.e., along the axis of a circular current from minus infinity to 
plus infinity, may be written in the form 


a dl 

So (Vee ay? 
where a is the radius and | is the distance measured along the axis 
of the circuit. The integral is easily evaluated by using the new var- 
iable B, defined by the formula —_ cot B. It turns out to be equal 


to. Substituting + Ina? for M and equating the value of the 
magnetomotive force to AJ, we obtain 


k= 4t (in the Gaussian system of units) 
kk =1 (in the electrical engineering system of units). 
The magnetomotive force relation now assumes ‘the form 
40 NV = 
§ Hat=—= SVT or $H db = DiI. 


The magnetomotive force relation is very useful in determining 
the magnetic fields of various systems. Its application is facilitated 
by considerations of symmetry and in this respect the following 
discussion is quite analogous to the discussion relating to the solu- 
tion of the corresponding problems in electrostatics by means of 
the Gauss-Ostrogradsky theorem. 

Let us first consider an infinitely long rectilinear current. From 
considerations of symmetry it is evident that the flux lines must be 
circles whose centres lie on the wire axis. It is similarly evident that 
at all points on such a circle the numerical value of the intensity is 
the same. Applying the magnetomotive force relation to such a flux 


line, we obtain: H§d=2. Here, § dl is simply the length of 
a flux line. If the points under consideration are located at a distance 
r from the wire axis, then § dl = 2ur. Thus, for the magnetic field 


of an infinitely long rectilinear current in the region outside the 
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wire, we obtain 


Haz (in the Gaussian system of units) 


fel =a (in the electrical engineering system of units). 


Let us now determine the magnetic field intensity inside the wire. 
Assume that the radius of the wire is designated by a and that the 
current density is uniform over the wire cross-section. Hence, the 
flux lines within the wire must also be circular. Consider such a flux 


line of radius r. The current flowing through it will be, I . There- 
fore, the magnetomotive force rela- 


H tion yields: 
2 
Hx Qar=“2 7, ana 
c a 
2r 
| —— 
7 In the electrical engineering sys- 
Ly tem of units 
ae 
etl Sra 


Thus, we see that the magnetic field 
intensity along the wire axis is 
equal to zero. The intensity in- 
creases with radius and becomes a maximum at the surface of the 
wire. Then, for the region outside the wire, the field intensity de- 
creases, being inversely proportional to the distance from the axis 
(Fig. 115). 

If the field is determined at a point for which the distance r is much 
less than the distance to the end of a wire, then the formula H = Z 


Qnr 
is also valid for a wire of finite dimensions. 


Fig. 115 


Example. Let us calculate the magnetic field intensity at a distance of 5 cm 
from the axis of a rectilinear current of 20 a. 


In Gaussian units (I = 20 x 3 x 109 = 6 X 1010 statamps): 
aI ®+ 2x6 x 1010 . 
A a= — = OR 
cr Bsc A010 sc 5 0.8 oersted 
In practical units (J = 20 a and r = 0.05 metre): 
I ies 20 __ pa, amps 
2nr = 2nx0.05 ‘metre * 


Another important example of the application of the magneto- 
motive force relation is in the calculation of the field of a solenoid. 
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Consider a uniformly wound toroidal solenoid whose circumferen- 
tial length is Z. The field within the solenoid is uniform and all the 
flux lines are concentric with Z. This system plays the same role in 
magnetic field theory as a parallel-plate condenser of infinite extent 
in electric field theory. Each flux line envelops all n turns. Therefore, 
the magnetomotive force along a flux line of length Z is 


§ Hal = nl. 
Since § Hal = HL, we obtain 


= —— J (Gaussian system) 


H = 1 (electrical engineering system). 


The magnetic field intensity of the coil is determined by its “ampere- 
turns”, i.e., the product of the current strength and the number of 
turns per unit length of the solenoid. It should be noted that the sim- 
plicity of the last formula is one of the justifications for the electrical 
engineering system of expressing the field equations. Since a solenoid 
is one of the basic elements of electrical engineering devices, simpli- 
fication of the formula for the calculation of its magnetic field inten- 
sity is of great practical significance. 


The formula H = 1 is also valid for a straight solenoid if used 
to determine the field within the solenoid at points sufficiently far 
away from the edges. 

Example. The magnetic field intensity at the centre of a long, thin sole- 
noid, where L = 15 cm, n = 1,500 turns and J = 0.1 a, is 


amps 
metre ° 


H = 1,000 


In the Gaussian system: 


4n nm, 40 1,500 9\ — : 
H=— cA l=337q00 a5 (0.4 x3 x 10%) = 12.56 oersted; 
4 paid ere x 10-3 oersted and 1 oersted ~ 80 ba 
metre metre 


105. Law of Electromagnetic Induction and Lorentz Force 


The phenomenon of electromagnetic induction discovered by 
Faraday, the great English physicist, may be described as follows: 
An electric current is induced in a closed conductor loop if the value 
of the magnetic flux passing through the loop changes. Moreover, the 
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inducted emf is proportional to the rate of change of the magnetic 
flux, i.e., the derivative with respect to time 


ew , where (D = c Boosads. 
dt a 

We shall show that the law of electromagnetic induction is closely 
related to the existence of a Lorentz force. If the electromagnetic 
induction is due to the displace- 
ment of a wire inamagnetic field, then 
the law of induction follows from the 
expression for the Lorentz force. 

In order to avoid confusion due to 
difficulties of a purely mathematical 
nature, let us simplify the proof by 
assuming that the induced emf arises 
in a rectangular circuit oriented per- 
pendicular to the flux lines ina uni- 
form magnetic field. A change in flux 
produces atranslatory displacement of 
one side of the rectangle of length / 
as shown in Fig. 116. Since there are 
free charges in the displaced con- 


Fig. 4116 ductor, these charges are subjected 
to the action of a Lorentz force 
{= <vB when the conductor moves with a velocity v. In view 


of the ‘fact that the velocity, magnetic field and conductor are 
at right angles to each other, we are able to dispense with the vector 
notation in the formula for the force since the sine of the angle is 
equal to unity. The Lorentz force is directed perpendicular to the 
plane passing through the direction of the velocity v of the charges, 
and hence of the wire, and the direction of the magnetic flux lines. 
Thus, the force is directed along the wire. The charges are impelled 
to move and an induced current is thereby created. 

The electromotive force is equal to the work of moving a unit 
charge around a closed circuit. Since the force acting on a unit 


charge is equal to + vB, the work of this force along the moving 


wire is equal to = vBI. Moreover, no work is performed along the rest 


of the circuit. Hence, the last expression is the expression sought for 
the induced emf. It has the form: 


eit 4 vBl (in the Gaussian system of units) 


6'"4 =vBl (in the electrical engineering system). 
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Assume that the wire moves a distance dz during the time dt. 
Thus, the area of the circuit increases by the amount ldz = dS and 


the magnetic flux by the amount d® = BdS. Since v= dz the 


dt’ 
: : : : 1 BaS 
induced emf may also be written in the following form: rar Tae But 
this is precisely the expression for Faraday’s law of electromagnetic 
: A F : {1 d@. sy * . 
induction, i.e., 6°" = a n the Gaussian system of units and 


dt 

ind dM, ‘ : : 
eu = ——,- in the electrical engineering system. 

We have thus demonstrated that electromagnetic induction and 
the deflection of moving electric charges in an external field repre- 
sent different aspects of one and the same law of nature. In the 
next chapter, we shall return to this interesting problem. Here, 
it was only necessary to present the essence of the electromagnetic 
induction law. 


106. Measurement of Magnetic Fields by Means of 
Induced Impulses 


The phenom:znon of electromagnetic induction is utilised in the 
design of precision instruments for the measurement of magnetic 
fields. Assume that it is necessary to deter- 
mine the value of the magnetic field at. some 
point in space. A’ small, flat coil or a single 
turn of wire is placed in a magnetic field per- 
pendicular to the flux lines and the ends of 
the winding are connected by means of leads 
to the terminals of a ballistic galvanometer. 
Now, if the coil is rapidly turned through 
a 90° angle in such a manner that its flat 
surface becomes parallel to the flow lines, an 
electrically induced current flows in the wind- t 
ing as the coil is being turned. The brief Fic. 417 
flow of current, which rapidly reaches a max- ie 

’ pidly ! 
imum and then drops to zero, is called an 
induced impulse (Fig. 117). During this brief interval, a certain 
quantity of electricity flows in the wire. The charge can be very 
accurately measured by means of a ballistic galvanometer, a de- 
vice having a moving coil with a high moment of inertia that 
integrates the electric current over the period of the impulse. 

If the resistance of the coil is R and the number of turns n, the 
induced current strength may be written in the form 

n d@ 


“ R dt 


f 
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The quantity of electricity flowing in the wire during the period of 
the induced impulse is 
t 2 
Q= \ I dt=R-n \ d® = Rn (D,—,), 
0 1 


where @, is the value of the flux passing through the coil in the 
first position and @®, is the value in the second position. 

If ®,; or M2 is equal to zero, i.e., magnetic flux lines do not pass 
through the coil in the initial or final position, the performed meas- 
urement yields the value of the magnetic induction. For this purpose, 
we need only divide the value of the magnetic flux by the coil area S, 


ie., B= =. 


n 

Of course, other methods of measurement are also possible. Thus, 
instead of rotating the coil, the field may be switched on or off. Also, 
to make the effect more pronounced, the coil can be turned through 
an angle of 180° instead of 90°. This doubles the effect. Similarly 
the polarity of the field can be reversed, rather than simply switched 
on or off. 

Since the measuring coil may be made as small as a square milli- 
metre, measurements by this method enable us to accurately deter- 
mine the magnetic field in small regions. 

This method is also used to measure magnetomotive force. For 
this purpose, we use a measuring device known as a Rogovsky belt— 
a long coil on a flexible belt. The belt may be shaped into any desired 
form and its two ends placed at any two points in a given region. 
Also, if desired, the ends of the belt may be brought into contact 
with each other. We shall show that when the field is switched off 
the deflection of a ballistic galvanometer connected to such a meas- 
uring belt is proportional to the magnetomotive force along the path 
of the flexible belt. 

The deflection of the ballistic galvanometer is a measure of the 
quantity of magnetic flux passing through all the turns of the coil. 
Let n be the winding density, i.e., the number ofturns per unit length 
of the measuring belt. Then, on a small belt segment Al;, there are 
nAl; turns, and the magnetic flux passing through these nAl; turns 
is equal to M;nAl;. 

If the medium is homogeneous and all the turns have the same 
area, then 

DO, = pSHi, 
and the total magnetic flux passing through the entire measuring 
belt is 7 
O= >i pSnH,Al; 
s ’ 
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Taking the limit for Al; > 0, we obtain 


2 
D=pSn | Hdl. 
i 


Since the measurement occurs in a medium for which p does not 
significantly differ from to, the quantity pSn is a constant of the 
instrument. The throw of the ballistic galvanometer in these belt 
measurements is exactly proportional to the magnetomotive force 
between the points at which the ends of the belt are located. 

By means of this device, it is easy to demonstrate the validity of 
the laws discussed in Sec. 104. Thus, as long as the coil encircles 
one and the same current, the magnetomotive force will remain the 
same for all configurations. Also, it is easily verified that the mag- 
netomotive force along a circuit not encircling current is equal to 
zero. For the case when the coil encircles a current several times, the 
magnetomotive force can be shown to increase by the corresponding 
number of times, etc. 

It should be emphasised that magnetic field measurements by 
means of induced impulses are of particular importance when we are 
concerned with the magnetic field inside a solid body. The only other 
method available is to make a slit in the solid body. Such a proce- 
dure is usually not possible. 

Let us consider the most common problem—the measurement of 
the magnetic permeability of an iron body. The most accurate results 
are obtained when the substance under investigation is in the form 
of a toroid. Two windings are wound on such a ring—one connected 
to a current source and the other to a ballistic galvanometer. If 
current is flowing, a magnetic flux © = BS passes through the 
ring. By reversing the direction of the current through the primary 
winding, an induced current is produced in the secondary. The 
quantity of electricity Q flowing through the galvanometer is related 
to the magnetic induction inside the ring by a relation already 
discussed, namely: 


p= & 


NoS ui 


where S is the cross-section of the toroid (assuming the turns are 
wound right on the surface of the ring), 72. is the number of secondary 
turns and fF is the resistance of the secondary winding. As regards 
the magnetic field intensity, it may be determined by the formula 
ats . The magnetic permeability of 
the substance under investigation is then equal to B divided by H. 


for a ring solenoid, namely, H = 


286 Magnetic Fields 


107. Finite Bodies in a Magnetic Field 


To one degree or another all bodies possess magnetic properties. 
These are indicated, first, by the fact that a magnetic field exerts 
forces and torques on bodies and, secondly, by the fact that a body 
placed in a magnetic field distorts the field. As was indicated above, 
the magnetic properties of a substance are characterised by the coel!- 
ficient 1—the magnetic permeability of the substance. In accordance 
with the value of 1, bodies may be divided into three distinct classes: 
ferromagnetic substances—including iron, nickel and cobalt—whose 
relative permeabilities are much greater than unity; paramagnetic 
substances, whose permeabilities are somewhat greater than unity; 
and diamagnetic substances, whose permeabilities are slightly less 
than unity. Typical values are given in the table. 
a NN ee gh ee Be 


Substance ww x 
Copper .............. 0.999990 — 10-5 
Wate in. ee, a aes Se 0.999991 —9x 10-6 
Platinum ............, 4.000300 300 x 10-8 
Silicon: 4-2. dave ie Be 0.999986 —14x 10-6 
Tungsten sos. 4-0 ais oe oe a) oe 1.000079 79x 10-6 


When a diamagnetic or paramagnetic body is placed ina magnetic 
field, the distortion of the field is negligible. On the other hand, when 
a ferromagnetic body is placed in the field, there is considerable 
distortion. 

The forces exerted by magnetic fields may be detected without 
particular difficulty in the case of paramagnetic and diamagnetic 
bodies. In the case of iron objects, everybody is familiar with the 
fact that magnetic fields exert large forces. 

Let us first consider magnetic forces. A body that does not possess 
magnetic properties becomes magnetic when placed in a field. This 
magnetisation process manifests itself in the acquisition of a mag- 
netic moment by the body. As we know, a system possessing a magnet- 
ic moment may he detected’in two ways. In a uniform field, the 
body tends to become oriented in such a manner that the direction 
of the moment is parallel to the external field. Moreover, in a nonu- 
niform field the body will experience a force tending to move it 
along the lines of force. 

In the case of ferromagnetic bodies, the torque may be detected 
without difficulty. The magnetic moment of the body may then be 
determined from the formula N = [MH]. However, we are not 
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usually interested in a body having a particular shape, but are 
interested rather in the substance as such. Therefore, when possible 
the measured value is recalculated on the basis of unit volume. The 
vector directed along the magnetic moment and numerically equal to 
the magnitude of the magnetic moment per unit volume is called the 
magnetisation vector J. Of course, the magnetisation vector can be 
determined without difficulty from the magnetic moment of a body 
only if we are sure that the magnetisation of the sample is uniform. 
This is the case when the sample is in the form of an ellipsoid or 
a degenerate ellipsoid, i.e., a cylinder, plate or sphere (see p. 261). 
That is why such bodies are used in experiments of this kind. 

Determination of the magnetisation vector by measuring the torque 
is easily accomplished for ferromagnetic bodies. Since the torque 
is very small for paramagnetic and diamagnetic bodies, this meas- 
urement is difficult to perform. It is therefore preferable in these 
cases to measure the forces acting on a body located in a nonuni- 
form field. 

Let us consider a small volume of a magnetic substance located in 


a nonuniform field. For simplicity, assume that the field varies 
along one axis and that the gradient is equal to sae Since a small 
volume of magnetic substance behaves like a magnetic dipole, the 
potential energy of a unit volume may be written in the form U = 
= —JH. If the moment acts along the field, the force exerted on 
a unit volume of the magnetic substance is equal to the derivative 


of the potential energy with respect to the coordinate, i.e., 
dH 
[a gee 


Thus, if we know the field gradient, we can determine the magnetic 
moment of a unit volume of the body under investigation by meas- 
uring the force. In practice, there are various ways of accomplishing 
this, the simplest being by means of a so-called magnetic balance. 
A thread is passed through an aperture made in one of the pans of an 
analytical microbalance. Then, the sample is attached to the end of 
the thread and suspended between the poles of a magnet. Before and 
after energising the magnet, the sample is balanced; hence, the 
difference between the readings is equal to the value of the force f. 

The weights must be quite accurate as can be seen from the fol- 
lowing example. A piece of bismuth, a substance whose diamagnetic 
properties are most pronounced, has a magnetisation J of 2 x 10-? 
Gaussian unit when placed in a magnetic field whose intensity H 
is ~1,000 oersteds. If the nonuniformity of the magnetic field is 
at 50 oersteds/em, a force of only 41cm will be exerted on each 
cubic centimetre of bismuth, i.e., f~ 1 dyne/cm?. 
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Experiments show that for diamagnetic and paramagnetic bodies 
the following simple relationship exists between the magnetisation 
vector and the magnetic field intensity: 


J =p «HH, 


where x is the magnetic susceptibility. For diamagnetic bodies x 
is negative, while for paramagnetic bodies it is positive. Values of x 


= ¥ s 
@) Diamagnetic body 6) Paramagnetic body 


Fig. 118 


are given in the table on p. 286. When x is positive the magnetisation 
vector is in the direction of the field intensity vector, but when x is 
negative, i.e., for diamagnetic bodies, the magnétisation vector 
is opposite to the field. 

Due to this difference in sign, the behaviours of these two classes 
of bodies under identical conditions are completely different. This 
is illustrated in Fig. 118. As can be seen, the differences are quite 
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striking. A paramagnetic body is attracted toward the region of 
strong-field, while a diamagnetic body is repelled from such a region. 
In a uniform field, a paramagnetic needle tends to become oriented 
with its axis along the flux lines, while a diamagnetic needle tends 
to become oriented perpendicular to the flux lines (see the analo- 
gous example in the case of a dielectric, p. 256). 

The determination of magnetic susceptibility by measuring the 
force in a nonuniform field may be accomplished for solid bodies in 
the form of a monocrystal or powder. This method is also easily 
adapted to liquids. In this case, the experiment can be so arranged 
that the measured quantity is the increase or decrease in the level 
of the liquid as it is attracted toward, or repelled from, the region 
between the poles of a magnet. 


108. Relationship Between Permeability 
and Susceptibility 


Both permeability and susceptibility may be determined straight- 


forwardly. The permeability is determined from the formula » = 5 


by measuring the induction and field intensity. The ‘susceptibility 
is determined, as described in the preceding article, from the forces 
exerted on a magnetic substance. To be sure, the relationship between 
these two characteristics of the magnetic properties of a substance 
can be established experimentally. However, there is no need to do 
this since an exact and simple relationship exists between p and x. 
This will now be shown. 

Let us return to the experiment for determining the magnetic 
permeability of a body in the form of a toroid. The primary winding 


creates a field of intensity H = ue , which is independent of the sub- 


stance of the toroid, i.e., if the toroid were not present, the field 
intensity would be given by the same formula. The situation is 
different as regards the magnetic flux. It can be shown experimental- 
ly that the value of B depends on the magnetic permeability. If 
an iron core is placed in the coil, B becomes hundreds or thousands 
of times greater than in the case of an air core. This increase in mag- 
netic flux is due to the magnetisation effect. 

First, it should be noted that the magnetic induction of a ring 
solenoid without an iron core (tof) has the significance of magnet- 
ic moment per unit volume. 

The magnetic moment of one turn is equal to JS, for in this discus- 
sion we shall employ the practical system of units. For the total 
magnetic moment of the system we obtain n/JS, and the magnetic 


moment in unit volume, ate , is simply equal to the field intensity. 


i9—1409 
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The magnetic moment of the equivalent dipoles is po times greater 
(cf. Sec. 103). Therefore, the magnetic induction poH of a uniform 
magnetic field created by the turns of a ring solenoid without an 
iron core can be expressed as the magnetic moment of the equiva- 
lent dipoles per unit volume. 

We are entirely justified in assuming that the significance of mag- 
netic induction is maintained if, without disturbing the uniformity 
of the field, the coil is filled with an additional number of magnetic 
dipoles. If J is the magnetic moment per unit volume due to the 


additional dipoles, the magnetic induction increases by this amount 
and becomes 


B=wH+J. 
Such an increase in B also occurs when the solenoid is filled with 
a magnetic substance. Since J=pyxH, then B=wy (x + 1)H; hence, 
the susceptibility and permeability are related by the equation 
p=1i+x 


Analogous calculations employing the Gaussian system of units 
lead to formulas with other coefficients. The magnetic moment of 
currents (and dipoles) per unit volume is 


RATS 
c 
LS 


aa 
An 


H. 
Therefore, in the presence of a medium, 
1 1 : 
@mebaZit+, 1.€., B=H+4a/. 
Letting J = x’H, we obtain 
B=(14 40x’) H. 
Hence, 


=1+4nn’ pa 5 
wp=1-+4nx’, where x iz 

Example. Let us perform the calculation in the example on p. 287 employing 
the practical system of units. For bismuth, %’ = 2 X 1075, i.e.,4 = 4x’ = 
= 8x & 1075. The piece of bismuth is in a magnetic field whose intensity is 


amps 10° amps 
metre 4% metre’ 


I = 1,000 oersteds = x 10° x 1,000 


Furthermore, the nonuniformity is expressed by 


aly = 50 oersteds =50x a x 103 x 100 
cm any 


dx 


amps — 5xX108 amps 
metre2 ~ 4m metre2 * 
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‘< volt sec 


The bismuth magnetisation is given by J =,t9xH =8n xX 107 metre?” 


Hence, 


the force acting on a unit volume (1 metre’) is 


_ 7 aH _ 4yn-6v, 2X 108 — 4, newtons 
ytons dyne ‘ 
Clearly, 10 ar 1, which agrees with the result obtained in 


the previous example. 


109. Distortion of a Magnetic Field Due to the Presence 
of a Magnetic Substance 


The problem of magnetic field distortion has practical significance 
only when the distortion is due to an iron body. To a large extent, 


: Fig. 149 


we shall be repeating the analysis given on p. 259 for the analogous 
case of dielectric bodies. 

At the boundary of two media of different magnetic permeability, 
the magnetic field vectors (induction as well as intensity) are refract- 
ed. To determine the law of refraction, let us first consider the 
magnetomotive force along a small circuit ABCD whose sides paral- 
lel to the boundary surface are close to each other, but on either 
side of the boundary as shown in Fig. 149. Since no current flows 
through this circuit, the magnetomotive force is equal to zero. 
Let us resolve the magnetic field intensity vectors on either side 
of the boundary into normal and tangential components. From the 
figure it is evident that the magnetomotive force can be equal to 
zero only if the tangential components are equal to each other: 


Ay, = Ap. 
19* 
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Another condition at the boundary between two media is estab- 
lished by considering the magnetic flux passing through a small cylin- 
der (not shown in the figure) enclosing a portion of the’ boundary. 
Since the magnetic lines have no sources, the number of flux lines 
entering the cylinder through the top is equal to the number leaving 
through the base. The lateral surface has an infinitely small area and 
the flux through it is equal to zero. Now, let us resolve the magnetic 
induction vectors on either side of the boundary into normal and tan- 
gential components. Clearly, the flux entering the cylinder can be 
equal to the flux leaving the cylinder only if the normal compo- 
nents of the induction 
vectors do not change in 
crossing the boundary: 
Bin = Bon. 

From these two rules, 
we can determine the 
law of refraction for flux 
lines. It is evident from 
the figure that 


tana py 
tana ta * 
igs nee In passing from airinto 


iron, the magnetic flux 
lines are deflected from the normal to a considerable extent and as 
a result the flux density sharply increases. For this reason, an iron 
body, whose permeability is hundreds or thousands of times greater 
than py “absorbs”, flux lines. This is the basis of magnetic shield- 
ing. Magnetic flux cannot, in effect, penetrate into a region 
bounded by iron since practically all of the magnetic lines enter 
the-iron (Fig. 120). 

The distortion of a magnetic field due to a magnetic body of 
specified shape is determined in exactly the same manner as in 
the case of a dielectric. If the iron body has the form of an ellipsoid, 
cylinder or plate, theoretical calculations show that the field inside 
the body will be uniform if the field was uniform before the iron 
was introduced. A relationship completely analogous to that given 
in Sec. 96 exists between Hy, the external uniform field before the - 
body is introduced, and H;, the field inside the iron after the body 
is introduced. The field intensity inside the iron body becomes less 
than the original intensity by an amount proportional to the magnet- 


isation: 


Hy= Ho J. 
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In order for the demagnetisation factor to be dimensionless, it is 
necessary to divide the magnetisation by the magnetic permeability 
of vacuum. Continuing with the practical system of units and sub- 
stituting 


J = [to (w—1) Hi, 
we obtain the following relationship between the external and inter- 
nal fields: 
pee HEM ss 
[-—h Nv’ 
In the Gaussian system of units 
H,;=H,—WNJ, 


A eng 9 


40 


A,= 


and the relationship between the external and internal fields is 
flven by 
H 
H; = ——*- ‘ 
1+ (u—1) Z> 
An 

The demagnetisation coefficient has the same value as in the case 

of dielectrics: 


w= (wv =4) 


for a.sphere, N = 4x (N’ = 1) for a plate, etc. 


110. Magnetic Hysteresis 


In discussing the permeability of iron, we may have created the 
alse impression that the magnetic properties of ferromagnetic sub- 
stances differ from those of paramagnetic substances only as regards 
the magnitude of the permeability. This is by no means the case. 
“erromagnetic bodies differ from the others mainly in that the mag- 
netic state of such a body is not linearly dependent, and moreover 
is not uniquely dependent, on the magnetic field intensity. Therefore, 
the concept of permeability for ferromagnetic substances is very rel- 
ative. The magnetic properties of iron are best illustrated by a mag- 
netisation vs. field intensity curve or a magnetic induction vs. field 
intensity curve. These curves are very similar to each other. 

Let us consider the magnetisation of an iron body as a function of 
the field intensity. At first, the magnetisation increases slowly, then 
rapidly, and finally magnetic saturation sets in. Such magnetisation 
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curves were first used by A. G. Stoletov and are characteristic of all 
ferromagnetic bodies (Fig. 121). We reiterate: the magnetisation 
and magnetic induction curves are very similar. The slope of the 
magnetisation curve gives the magnetic susceptibility, while the 
slope of the induction curve gives the magnetic permeability. From 
the figure, it is seen that the per- 
meability (also susceptibility) 
J] te curve has a maximum. For weak 
es fields the permeability is low. As 
.the field intensity is increased, p 
increases to a maximum, then be- 
gins to drop, and after reaching 
saturation remains unchanged. 
When the value of the permeabil- 
ity is given without specifying 
the external conditions, the maxi- 
mum permeability is usually 
Q 20 40 60 80 100 120 140 H,a/em meant. 
Fe TUS TE But there is more to be said 
about the behaviour of ferro- 


250 

200 magnetic substances. Let us assume 

‘ that after the iron has been 
brought to a state of magnetic satu- 

100 


ration the magnetic field intensity 
is decreased. It turns out that 
the induction decreases along a 
20 40 60 80 100 120 40 Hafm different curve, i.e., a curve lying 
higher than the initial magnet- 
isation curve. The field intensity 
may. be decreased to zero, but 
magnetisation remains. The corresponding values of magnetisation and 
induction at this point are referred to as the residual values. To remove 
the residual magnetisation, the polarity of the field must be changed. 
Thus, in the case of the experiment discussed on p. 285, it would 
be necessary to reverse the polarity of the current in the primary 
winding. Demagnetisation occurs when the field intensity attains 
a value H,, the value of the so-called coercive force. As the current 
is increased further in the same direction, the body begins to become 
magnetised in the reverse direction, i.e., what was previously a south 
pole becomes a north pole. The magnetic flux increases until 
the magnitude of the saturation is the same as in the initial 
process. ° 

Having attained a negative induction maximum, we may then 
proceed with the process in the other direction. In this manner, the 
hysteresis loop shown in Fig. 122 is obtained. 


Fig. 124 
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It is seen from the figure that knowledge of the intensity of the 
field in which the iron is located is not sufficient to determine the mag- 
netic induction and, hence, the magnetic permeability. Thus, for 
example, it is seen that three values of induction are possible for 
H = 400 oersteds—one occurs during initial magnetisation, the 
second during demagnetisation and the third when the magnetisa- 
tion process is repeated, i.e., just before the hysteresis loop is closed. 
The value of the magnetic induction, and the magnetic permeabil- 
ity, depends’ on the previous “history” of the sample. Hence, the 
esignation “hysteresis loop”. 
A hysteresis loop is usually drawn on the assumption that the fer- 
romagnetic body is brought to magnetic saturation. However, we can 


~/400-1200-1000 -800 -600 +f4I0-200 0 400 600 800 1000 1200 \Hoersted 
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fig. 122 


Clearly obtain numerous hysteresis loops having smaller dimensions 
y inscribing them, as it were, in the fundamental loop. For this 

Purpose, it is necessary to begin the demagnetisation process before 
reaching saturation. Then, to each H value there corresponds an 
Infinitely large number of values. ; 

A procedure based on this fact is used to bring a ferromagnetic 

ody to a state in which both induction and field intensity are equal 

to zero. This “zero point” is achieved by a series of successive magnet- 
ic reversals, whereby each succeeding cycle is begun at a lower 
intensity level than the previous one. 

The magnetic state of iron cannot be characterised only by the 
value of the permeability, field intensity or induction. Two of these 
quantities must be known—e.g., the induction and the intensity. 
The magnetic state of the iron is then represented by a point inside 
the fundamental hysteresis loop. 

The nature of a hysteresis loop depends to a great extent on the 
material. A body is said to be magnetically soft if the coercive force 
(and hence the loop area) is small. Typical soft materials are pure 
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iron, silicon steel, and iron and nickel alloys (particularly permal- 
loy—78% nickel). Carbon and other steels belong to the magnetically 
hard materials and are used in the manufacture of permanent may- 
nets. 

Experiments show that the temperature of a ferromagnetic sub- 
stance rises when it is subjected to magnetic reversals. This is very 
important in electrical engineering, for when iron is placed in a var- 
iable magnetic field the point on the B = f (H) curve representing 
the magnetic state of the iron is continuously tracing a hysteresis 
loop. Every. time a loop is traced a certain amount of heat is released, 
which according to magnetic field theory is related to the loop area; 
of course, the lower the value of the induction maximum, the smaller 
the loop area. Therefore, empirical formulas may be sought relating 
heat released and maximum induction. In electrical engineering. 
for example, the following formula is widely used: 


Q = nBrices 
where 7 is a coefficient whose value is given in tables. 


Example. For a good transformer steel, y = 0.00114. When Bmax = 
= 10,000 gausses, the losses are equal to Q = nBi Ss = 2.5 X 10° ergs/cem® = 


max 
= 2.5 X 1074 joule/em’. This means that for magnetic reversals in iron due 
to an alternating current whose frequency v is 50 cps the power loss is equal 
to 12.5 x 107-3 watt per cubic centimetre of iron. 


CHAPTER XVI 


ELECTROMAGNETIC FIELDS. MAXWELL’S EQUATIONS 


111. Generalisation of the Law of Electromagnetic Induction 


In the preceding chapter it was shown that the motion of a conduc- 
tor in a magnetic field is accompanied by induction phenomena. 
If this moving conductor is part of a circuit through which the mag- 
netic flux changes when the conductor moves, a current correspond- 


ing to the induced emf ¢ = es flows in the circuit. This current 
is due to the action of a Lorentz force: a force equal to + [vB] acts on. 


a unit electric charge. 
The induced current depends only on the relative displacement. 


of the conductor with respect to the magnetic field. Thus, it may be 
asserted with equal validity that a Lorentz force is produced when: 
a charge moves in a magnetic field or when the charge is “at rest” 
and the magnetic field moves. This follows from the principle of 
relativity. 

Consider a system of coordinates relative to which a magnetic 
field moves. Such a coordinate system may be fixed, for example, 
relative to a laboratory bench along which the pole of a permanent 
magnet moves. Then, a Lorentz force will act on charges at rest 
relative to this bench. Let us assume nothing is known about the 
moving permanent magnet. Having established that a force acts on 
the stationary electric charges, we are perfectly justified in conclud- 
ing that an electric field exists in this system whose intensity is, 
equal to the Lorentz force divided by the magnitude of the charge. 
Thus the electric field intensity in the “stationary” coordinate system, 
relative to which the source of constant magnetic field moves with 


velocity v, is expressed by the formula 
E=+ [vB). 


Of course, the relations differ for the electric field created by 
charges and the electric field created by the motion of the system rela- 
tive to the magnetic field. To begin with, this new field that we are 
considering has no charge sources. This means that the flux lines 
have neither beginning nor end. Moreover, it is not difficult to see 
that the flux lines of this electric field form closed curves, i.e., the 
electric field created by the moving magnetic field is rotational. 
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Imagine an arbitrary circuit that is stationary relative to the lab- 
oratory bench. The moving magnetic field crosses this circuit. If 
this imaginary circuit is replaced by a real wire circuit, then in 
accordance with Faraday’s law an emf is induced in the circuit thal 


is equal to Eat. Hence, the integral U = h Edt is not egual to 


zero. This means that the electric field # = 4 [vB] created by the 


moving magnetic field is a rotational field. 

For a real wire circuit UV = at ; Where @ is the magnetic flux 
passing through the circuit. However, it is immaterial whether or 
not wire is present at the location of the closed curve. The equation 
U= a2 is also valid for an imaginary circuit in the region where 
the sources of the magnetic field are moving. 

One final generalisation remains to be made. Experiments show 
that the cause for the change in the magnetic field is of no importance 
in the induction effect. The field change due to the motion of a per- 
manent magnet and that due to a change of current strength in a sta- 
tionary coil can always be made equal by, for example, bringing the 
permanent magnet closer or increasing the current in the coil creat- 
ing the field. Therefore, the law under consideration must be valid 
in all cases, no matter how the magnetic field is changed. Thus, if 
the magnetic field (magnetic flux) changes in a certain region in 
space, a rotational electric field is produced that is related to the 
magnetic field change as indicated by the following law. The elec- 


tromotive force U = G Edt along a closed curve is equal to the 


derivative with respect to time of the magnetic flux passing through 
this circuit: 

4 do 

c dt 


or, in the practical system of units, 
dD 


dé 


This is the generalised law of induction, one of the most important 
laws of nature. 

Let us examine the mathematical expression for this law. Equating 
the expressions for the electromotive force and the magnetic flux, 
the law can be written as follows in the Gaussian system of units: 


§ Edt = ee Beosads, 


ec 
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and in the practical system of units 


§ Edl = = \ BoosadS 

First, as regards the minus sign that appears in this equation, it 
should be noted that in vector analysis the direction in which the 
circuit is traversed and the direction of- 
the normal to the plane of the circuit are 
related to each other as follows: the posi- 
live direction of the normal in a right- 
handed screw system is such that as viewed 
from the vector terminus the circuit ap- 
pears to be traversed in the counterclock- 
wise direction (Fig. 123). Let us construct 
a closed curve in space and ascribe an 
arbitrary direction to it. The direction of 
the normal to the area encompassed by 
the curve under consideration is thus 
determined. Magnetic flux passes through 
this circuit. At a given instant, it may Fig. 123 
be positive or negative, depending on 
whether the induction vector forms an acute or obtuse angle with 
the normal. The derivative of the flux with respect to time is positive 


n 


) Positive 


Positive dtrection 
adtrection 


® decreases @ increases 
Fig. 124 


if the flux is increasing and is negative if the flux is decreasing. Thus, 
taking the minus sign into account in the induction formula, the 
law may be stated as follows: The electromotive force is positive 
if positive flux is decreasing or negative flux is increasing, i.e., the 
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direction of the electric lines of force coincides with the adopted 
direction for positive. On the other hand, the electromotive force 
is negative if positive flux is increasing or negative flux is decreasing. 
These relationships are clearly illustrated in Fig. 124. 

We shall show that the minus sign in the induction formula is 
the mathematical expression of Lenz's law. Assume, for example, 
that the north pole of a bar 
magnet approaches a coil and 
that the positive direction in 
the circuit is as indicated in 
Fig. 125. Then, the magnetic 
flux is positive and so is its 
derivative with respect to 
time. The electromotive force 
must be negative and the 
induced current is opposite to 
the direction adopted as positive. We can immediately find the 
magnetic field of the induced current by recalling that the flux lines 
emerge from the side of the current ring from which the current 
appears to be moving in a counterclockwise direction. Therefore, as 
the magnet approaches the circuit, a current of such direction is 
induced in the latter that the field produced tends to oppose the 
action which caused it. This is a statement of Lenz’s law. It is not 
difficult to verify this important rule for other particular cases as 
well. 

Let us sum up. A varying magnetic field is inseparable from an 
electric field. Moreover, it is seen that the division of fields into 
electric and magnetic is a relative matter. From one viewpoint there 
is only a magnetic field in space. From another viewpoint in addi- 
tion to the magnetic field there is an electric field. 

A rotational electric field consists of electric lines of force that 
link the magnetic induction vectors when the magnetic flux passing 
through a closed line of force changes with time. When the flux 
increases, the direction of the line of force is clockwise as viewed 
from the induction vector terminus. 


Fig. 125 


112. Displacement Current 


Electromagnetic field theory, whose foundation was laid by Fara- 
day, was mathematically perfected by the English scientist James 
Clerk Maxwell. One of Maxwell's most important new ideas was 
that symmetry must exist in the interdependence between magnet- 
ic and electric fields. ; 

In the preceding article, we discussed the problem of creating an 
electric field by varying magnetic flux. The question naturally 
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arises: Does a variable flux of electric lines of force create its own 
magnetic field? Maxwell answered this question in the affirmative 
and advanced the hypothesis that a relationship exists between 
variable electric flux and a magnetic field that is quite analogous to 
the generalised law of induction. According to the hypothesis, if 
a change in electric flux occurs in some region of space, a rotational 
magnetic field is created. Moreover, the magnetomotive force U 
taken along a closed curve is equal to the change in electric flux 
passing through this closed curve, i.e., 


dN 
G=—p: 

where 
U= {Hal 


and the electric flux 


N=\ Deosads. 
s 


In the Gaussian system of units 


The parallel in the relationships between magnetic and electric 
fields does not extend to the sign before the derivative of the flux. 
As is known, when currents are present the magnetomotive force 


along a closed curve is equal to U =/ (or 7 in the Gaussian system 


of units). How should the equation for magnetomotive force be writ- 
ten for such a closed curve that encloses electric current and variable 
flux of electric lines of force? Maxwell assumed that the magnetomo- 
tive forces are additive. Thus, the general formula has the form 


§ Hdl =14 50 


or in the Gaussian system 
4 dN 


§ Hdl == (1+-4-) - 


The expression aM has the dimensionality of electric current strength. 


Maxwell called it displacement current. He thereby incorporated 
in this designation the very widespread notion at the end of the 
nineteenth century that the field in a vacuum displaces the particles 
of an “ether” from their positions of equilibrium. .This designation 
has continued to prevail in science, although now the presence 
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of a field in vaccum is not related to the concept of particle displace- 
ment of any medium whatsoever. In a dielectric medium, the 


displacement current x may be resolved into two components, cor- 


responding to the intensity and polarisation vectors into which the 
displacement vector D can be resolved (see p. 255). Therefore, the 

portion of the displacement current “flow- 
rs ing” in the dielectric is determined by 
\ \ the change in the polarisation vector, i.e., 
by the relative displacements of the centres 
| ! of gravity of the positive and negative 
\ J charges. 

Before discussing the role of displacement 
current in one or another process, we shall 
prove an important proposition concerning 
the sum of the conduction and displacement 
currents. 

Consider an arbitrary system of. electric 
currents and imagine a closed surface 
drawn in such a manner that the currents intersect it. If the cur- 
rents are constant it follows directly from the law of conservation 
of electricity that the sum of the currents entering the closed sur- 
face must be equal to the sum of the emerging currents, or, to be 
more concise, the algebraic sum of the currents flowing through a 
closed surface is equal to zero. It is evident that this law may not 
be obeyed by variable currents: for example, in the case of a closed 
surface enveloping one plate of a condenser connected in an alter- 
nating current circuit (Fig. 126) or a closed surface through which 
the top of an antenna protrudes at one point. 

However, this theorem is valid for variable currents as well if 
the term “current” is taken to mean “total current”, i.e., the conduc- 
tion current plus the displacement current, rather than the conduc- 
tion current alone. To prove this, it suffices to consider an arbitrary 
curve on which a surface is based and for which the following relation 
is valid: 


Fig. 126 


§ Hal =I+Tacpr. 


By gradually reducing the closed curve to zero, the surface S on 
which this circuit is based becomes closed as shown in Fig. 127. 
(This process is similar to constricting the opening of a draw-string 
pouch.) The magnetomotive force reduces to zero and hence the sum 
of the conduction and displacement currents passing through the 
closed surface also equals zero. 

Now let us discuss the role of displacement currents in electro- 
magnetic phenomena. 
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It can be shown that displacement currents are negligibly small 
where the conduction currents are different from zero. Hence, the 
displacement current within a conductor is always disregarded. 

In calculating the value of the displacement current in dielectrics, 
two cases should be considered—displacement currents in a dielec- 
tric surrounded by a conductor forming aclosed circuit and displace- 
ment currents that are a continuation of a conductor of an open 
circuit. 

Consider a conductor forming a closed circuit in which electric 
current is flowing and which is intersected by a closed surface. If 


H 


Fig. 127 


the current is constant, then at each instant the same amount of 
electricity passes outward through the surface as inward. The situa- 
tion is different in the case of variable currents. Here, the strength 
of the variable current may have different values in different parts of 
the circuit (see below, p. 322). As a result, the strengths of the cur- 
rents passing inward and outward through the surface at a given 
instant may not be equal. Then, a displacement current “flows” 
via the dielectric from the point where the current is less to the 
point where the current is greater, and in this manner the current 
deficit is compensated for. Clearly, changes in the displacement 
current with respect to time will exactly correspond to changes in 
the conduction current. This phenomenon is significant only when 
the frequency of the current is sufficiently high. 

If the conduction current does not form a closed circuit, e.g., in 
the case of an alternating current circuit containing a condenser, 
the conduction and displacement currents are simply equal to each 
other. In this case, it may be said that the conduction current cir- 
cuit is closed by the displacement current. 

In spite of the fact that the magnitudes of the displacement cur- 
rents are quite large for such cases, certain calculations may be 
performed without considering them. Thus, when the conduction cur- 
rent circuit is closed by the displacement current between the con- 
denser plates, the magnetic field created in this region by the displace- 
ment current is the same as the field that would have been produced 
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if the conduction current flowed in an uninterrupted circuit. There- 
fore, the presence of displacement current does not affect the calcu- 
lation of the magnetic field, the coefficient of self-induction of a 
system, etc. 


113. Nature of an Electromagnetic Field 


The following equations, which were discussed in the two preced- 
ing articles, are called Mazwell’s equations: 


ae d@ dN 
{ Edl= — > and § Hdl =I+—_. 


aoe equations concisely sum up our knowledge of electromagnetic 
fields. ; 

Maxwell’s equations cannot be derived. The discussion of the pre- 
ceding two articles does not constitute a derivation, but constitutes 
rather an illustration of conjectures leading Maxwell to his discov- 
ery. 

A large class of phenomena of interest to physicists, electrical engi- 
neers and radio engineers obey Maxwell’s equations. The laws of 
these phenomena are a consequence of Maxwell’s equations and may 
be derived from them. The extraordinary importance of the predic- 
tions based on Maxwell's equations gives these equations equal rank 
with Newton’s laws of motion and the principles of thermodynamics 
as fundamental laws of nature. 

We shall not go into the mathematical methods of solving Max- 
well’s equations. It turns out that the above integral equations may 
be transformed into differential equations. Then, by solving Max- 
well’s differential equations, it is possible in principle to determine 
the electromagnetic field for a given distribution of charge and cur- 
rent. 

Let us again consider the physical essence of electromagnetic 
phenomena as given by Maxwell’s equations. It may be summa- 
rised in the following manner. 

The division of an electromagnetic field into electric and magnetic 
fields has only relative meaning. If from the viewpoint of an inertial 
system of coordinates only a magnetic field exists, then from the 
viewpoint of another system moving relative to this system there 
exists an electric field in addition to a magnetic field. The converse 
is also true, namely, if an observer in one system of coordinates 
finds only an electric field present, then an observer in another 
inertial system will find that both an electric field and magnetic 
field exist. 

Let us now consider an electromagnetic field from the viewpoint 
of an inertial frame of reference, first directing our attention to a 
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region of space in which free electric charges and hence conduction 
currents are absent. In this case, Maxwell’s epuations have the form 


. dN ae d@ 

) reel %, i — ee eed 

) Hdl=— and 4 Edl “7 

Both the magnetic and electric fields have a purely rotational charac- 
ter, i.e., the lines of force are closed and mutually interwoven: 
electric lines encircle magnetic lines and magnetic lines encircle 


aS 


electric lines. The electromagnetic field may be depicted as a chain 
of rings, whereby closed magnetic lines of force are alternately linked 
with closed electric lines of force (Fig. 128). Such a chain exists only 
if the field is variable: A ring of increasing magnetic flux creates 
about itself a ring of electric flux; the varying electric field creates 
a ring of magnetic flux, etc. 

If the region of space under consideration contains charges and 
currents, then in addition to rotational fields with linked lines of 
force there exists a rotational magnetic field whose closed flux lines 
encircle currents and a potential electric field whose flux lines begin 
on positive charges and terminate on negative charges. 


T_T. 


~ 
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CHAPTER XVII 


ENERGY TRANSFORMATIONS 
IN ELECTROMAGNETIC FIELDS 


114. Transformations in Steady Current Circuits 


Let us consider a portion of a conductor through which a steady 
electric current is flowing. If the resistance of the conductor segment 
is R and the potential difference across it is U, the current strength 


is determined by Ohm’s law, namely, J = a . The electric field 


performs work in moving charges along the circuit and the work 
per unit charge is equal to U. Since the current strength is, by defi- 
nition, the quantity of electric charge flowing through the conductor 
cross-section per unit time, the product JU yields the work per- 
formed by the field in moving the electric charge per unit time. This 
product, JU, represents power. If the current is a steady one, this 
work is completely converted into heat (thermal energy). Thus, 
the formula for calculating the thermal effect of current is 


pei EBS 

_ The transformation of the work performed by the electric field 
into heat occurs at each point of the conductor. To express this mathe- 
matically, Ohm's law must be converted into a form applicable to 
a point of a conductor, rather than to a portion of a conductor. By 


introducing the current density j, which is equal to 7 , where S 


is the cross-section of the conductor, and by replacing the expres- 
sion for the potential difference by El, and, finally, by expressing 
the resistance in terms of the conductor length J and its cross-section, 


iec., R= 7, we obtain: j = AE. 


Thus, it may be said that the current density is directly propor- 
tional to the electric field intensity. The specific conductivity A 
is the coefficient of proportionality and the direction of the current 
is assumed to coincide at each point with the direction of the inten- 
sity. The formula 


j=\E 


is called the differential form of Ohm’s law and should be viewed as 
an empirical law generalising the laws for current flow in conduc- 
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tors. Ohm’s law in its usual (integral) form is a consequence of this 
equation. 

Consider an infinitely small volume element of the conductor, 
dt, in the form of a cylinder whose generatrix dl is parallel to the 
flux lines and whose base dS is perpendicular to the current. The 
amount of electric charge flowing through a cross-section of the 
cylinder is jdS and the potential difference between the ends of the 
element is Edl. Hence, the work performed by the field in moving 
the electric charge through this volume is equal to jEdt. This for- 
mula also gives the heat released inside the volume dt. If we are 
interested in the work of the current in a large volume of the con- 
ductor, the last expression must be integrated. The formula 


gives us the expression for the work of the current or the thermal 
energy released per unit volume of the conductor. 

Thus, in the case of a portion of a direct-current circuit, the energy 
transformations are reduced to the transformation into heat of the 
work done by a field. However, the picture changes as regards the 
energy balance of the entire ¢losed direct-current circuit. The work 
performed by the electric forces along a closed curve when the field 
is constant is equal to zero, for the work performed by the electric 
forces in moving charge along the external portion of the circuit is 
equal and opposite to the work required to move the charge along 
the internal portion of the circuit. Therefore, the release of thermal 
energy in a direct-current circuit occurs only at the expense of the 
energy supplied by the current source—accumulator, electric gener- 
ator, elc.—i.e., at the expense of energy of nonelectrical origin or, 
as it is sometimes said, at the expense of the energy of an “applied” 
force. The role of electric current is reduced simply to the “transfer” 
of energy from the current source to the point where ‘the heat is 
released. The energy that a source is able to supply is given by the 
electromotive force ¢, which by definition is measured by the work 
performed in moving a unit charge along a closed curve. Actually, 
the applied emf performs this work only over those small portions 
of the circuit where the charge must be moved against the forces of 
an electric field. 

The power of the direct-current circuit is given by the expression 
I. This may be expressed in terms of unit volume if it is assumed 
that the applied forces are distributed throughout the volume. Then, 
the work performed by the applied forces is given by 


jE, 
20* 
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where Err! is the “intensity” of the applied forces. 

Designating the work of the applied forces by ?, and the thermal 
energy released by Q, the essence of the electrical transformations 
in a direct-current circuit may be expressed by the concise formula 


P-~Q=0. 


_ Example. For an isolated copper wire of cross-section S = 4 mm?, the per- 
missible current density in the case of an open wire is 7 = 9 % 10% amps/metre?. 
A 1-metre length of such a wire has a resistance of 4.25 x 1073 ohm. For the 
indicated value of j, a current~of J = 36 a flows in the wire and the thermal 
energy losses per second for this portion of the circuit amount lo /22 = 
= 1,296 X 4.25 « 1073 ~ 6 joules, i.c., in a unit volume 0.33 cal are re- 
leased each second. 


115. Transformations in a Closed Circuit of Variable Current 


A flow of variable current is inevitably accompanied by induction 
effects. Thus, to a variable current strength there corresponds a vari- 
able magnetic flux ©. Here, @ represents the number of lines of 
force which are created by the current circuit and which pass through 
the conducting circuit. In this case, the induction effects are due to 
the current’s own magnetic flux, whence the designation self-induc- 
tion. Since @ is continuously changing, an induced emf gs 


d@ : : . ; ‘ 
ara exists in the current circuit at each instant, in addition 


to the applied emf. 

The magnetic flux is always proportional to the current to the 
first power. Hence, the formula ® = L/ has universal validity. The 
coefficient Z is called the inductance of the circuit or the coefficient 
of self-induction. The value of L depends on the geometric properties 
of the circuit and the nature and distribution of magnetic bodies in 
the system. It does not depend on the conditions under which the 
system of conductors and magnetic bodies operate. Thus, for sel!- 
induced emf the following equation holds: 


pind ad] 

6 a —L-- . 

The significance of the minus sign in this formula may be explained 
as follows: When the current increases, the induced emf opposes the 
applied force, i.e., the induction is in the direction opposite to the 
applied force. On the other hand, when the current decreases, the 
directions of the induced emf and the applied emf coincide. This 
is the reason for the analogy generally made between mechanical 
inertia and self-induction. Self-induction impedes an increase as 
well as a decrease in current.. 
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Ohm’s law, relating emf and current strength, remains valid. 
Therefore, the product of current strength and total circuit resistance 
will at each instant be given by the following relation: 


IR= gr"! 4 Gnd a. garpl _ 7 a : 
t 


Multiplying both members of the equation by the instantaneous 
current strength, we obtain the energy equation: 
PR= 1g" — or 


Here, /gerr! = P is the work of the applied forces and [7R = Q 
is the thermal energy. It is seen that in a variable current circuit 
these two quantities are not equal to each other. The difference 


P — Q is equal at each instant to ug , i.e., it is equal to the deri- 


vative of = LI*. In other words, the excess of the work of the applied 
forces over the thermal energy released goes to increase the magni- 
tude of ae On the other hand, the excess of the heat released over 
the work of the applied forces occurs at the expense of the magnitude 
of + LP. The equation 


pond (Lr) 


is the expression for the law of conservation of energy. 

Clearly, the quantity W = + LJ* represents energy. It is the mag- 
netic energy of a system that is inseparably linked with the existence 
of a magnetic field in it. (In the Gaussian system of units the expres- 


: , ~ 11 : 2 . 
sion for magnetic energy is ay.) There is magnetic energy in a 


direct-current circuit too, but in this case it does not manifest itself 
since it remains unchanged. The induction effects occur only when 
the current is switched on and off. When the circuit is closed the 
applied forces perform work, which is expended not only in the 
release of heat, but also in the storage of magnetic energy. On the 
other hand, when the circuit is opened the thermal energy released 
is at the expense of the magnetic energy of the current. 

The magnetic energy formula may be verified experimentally by 
closing or, even better, opening a current circuit. The thermal energy 
released after the source is disconnected is numerically equal to 
the magnetic energy of the current. If the coefficient of self-induction 
is large, the release of heat continues over a period of time sufficient- 
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ly long to enable us to measure the heat by, for example, calorimet- 
ric means. 
Inductance may be measured in various ways and in the simplest 


cases may be calculated from the formula L = ° . The problem is 


reduced to the calculation of the magnetic flux passing through the 
system. 

An expression for the inductance of a ring solenoid will be required 
below. The magnetic flux ‘through one turn of a coil is D = 
= wow S, where S is the area of the turn, and the flux through 7 
turns is © = npopHS. Substituting the expression for the field 
intensity (using the practical system of units), we obtain 


T 
DO =nyows — : 


Now, dividing both members of this equation by the current strength, 
we obtain an expression for the inductance of a coil (also approxi- 
mately valid for a straight solenoid): 


2 
L=pop — S. 


The inductance of a coil is directly proportional to the magnetic 
permeability of the medium and increases sharply with the number 
of turns. An increase in inductance is achieved by using iron or by 
increasing the number of turns. In order to make clear the relation- 
ship existing between the coefficient of self-induction and the dimen- 


sions of the coil, let us multiply the numerator and the denomina- 
tor by l. Then, 


bow (3)°¥ 


and it is evident that the inductance is directly proportional to the 


volume occupied by the magnetic field and to the turn “density” 
squared. 


Example. Consider a long solenoid of small cross-section (! = 15 cm, n= 
= 1,500 turns, S = 1 cm? and J = 0.1 a). At the centre of the solenoid, the 


magnetic flux is D = nuowhHS = 6x x 10-5 volt sec. The inductance of this 
solenoid is 


(1,500)2 
0.45 


In practical units the inductance is measured in henrys (1 henry = 1 ohm 
sec). The inductance of coils in radio engineering is measured in millionths 
and thousandths of a henry. Chokes having iron cores can attain inductance 
values of the order of a number of henrys. 


n2 
L= jlo >-S= 40x 10-7 x 1 x x 10-4= 1.9 x 10-3 henry. 
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116. Magnetic Energy of a Field 


In the chapter devoted to electric fields, it was shown that the 
electric energy of a system may be viewed as a quantity whose den- 


sity distribution is represented by + eB? (in the practical system of 


units). The electric energy of the system is then determined by inte- 
grating this expression over the region occupied by the field. The 
importance of this circumstance was emphasised as it enables us to 
express the energy in terms of the field intensity and it confirms our 
conception of a field as something that can be localised. 
Naturally, we expect the situation to be similar for magnetic 
fields and this is indeed the case. It can be mathematically shown 


that the transition from the magnetic energy formula Abr, to 
oS fy ’ 2 


the expression for the magnetic energy density, + pow ° is complete- 


ly analogous to the corresponding transition from electric fields. 

Let us consider this transition for the simple case of the uniform 
field of a ring solenoid. Substituting the expression for the inductance 
in the magnetic energy formula, we obtain 


Hobe (+) “72 


5 V. 


Wu= 


But uf is the field intensity. Hence, the magnetic energy of a coil 
may be written in the form 


\H2 
Wu bake ae V~, 


so that the magnetic energy density is given by the expression 


— Hot? 
Wy = 5) . 
In the Gaussian system of units 
4 
Wy oo Ba pif?, 


Thus, for any system of currents, the magnetic energy may be 
represented by the integral over the volume occupied by the field: 


Wu=“2 \ pf? dt (in the practical system of units) 
and 


Wu=e \ wif? dt (in the Gaussian system of units). 
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Now, consider the magnetic energy of two currents. The expres- 
sion for this energy naturally divides into three integrals if the 
intensity of the resultant field HW is viewed as the sum of the field 
intensities of the two currents: H = HM, + MH». In the following 
expression for the magnetic energy, the significance of each of the 
integrals is quite evident: 


w= { wt dt + wo \ pH, H, dv + ( wll? dt. 


The first and third integrals yield the magnetic energy of the first 
and second currents, respectively, while the second integral repre- 
sents the interaction energy of the two currents. This last integral 
may assume different values even though the magniludes of the 
field intensities, H; and Hz, do not change. Thus, if the mutual 
disposition of the two-currents changes, the field vectors H, and Hz 
are turned, generally speaking, relative to each other and the value 
of the interaction energy also changes. 

Of course, the first and third integrals may be expressed in terms 


of the current strength and the inductance: ea oF and ay 31 fd As 


for the second integral, it is clear that its value is proportional to 
the product of the current strengths. Thus, 


{ WH Hy dt = MIyIo. 


The coefficient of proportionality AY is known as the coefficient of 
mutual induction. Just as in the case of inductance, AZ depends on 
the geometry of the system and the distribution of magnetic bodies. 

Thus, it is evident that the change in the magnetic energy of a 
system of currents is related not only to the work of the applied 
forces and the thermal energy released, but also to the work per- 
formed by the field in moving the conductors under the action of an 
Ampere force. Hence, the law of conservation of energy requires 
that the following equation be satisfied: 


dWy = —A—(Q—P) dt, 


where A is the mechanical work. Thus, it may be stated that, in the 
general case, the magnetic energy expended is equal to the work of 
moving the conductors and to the excess of released thermal energy 
over the work of the applied forces. 

The relations introduced in this article do not take into account 
one phenomenon—magnetic hysteresis. This problem will not be 
dealt with because of its specialised nature. 
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Example. The energy stored in the magnetic field of the coil described in 
the example on p. 310 is 


2 49. 10-3 4)2 
WwW re tt O05 x 10-5 joule. 


a 


The energy density is 
wu Hot = 4x x 10-7X 1x COO" = 0.63 joule/metre’. 


Naturally, the same result could be obtained by dividing the total energy of 
the magnetic field by the volume of the coil: 


Why 
MST 


117. Electrie Oscillations 


The processes of transforming electric energy into magnetic ener- 
gy and vice versa are of fundamental importance in electrodynamics. 


A simple system in which such transformations occur is a charged 
electric condenser whose plates are 


connected at a certain instant to the 
ends of a coil (Fig. 129). When the 
condenser discharges, -an electric cur- 
rent flows through the coil and creates 
a magnetic field around it. At each 


Cc 
instant, the electric field of the condens- : 
er and the magnetic field of the coil 
are closely linked. The energy of this Fig. 129 
system at each instant is equal to the 


energy of the electric field, which is concentrated mainly between 
the condenser plates, and the energy of the magnetic field, which 
is concentrated mainly inside the coil. As is well known, electric 
oscillations arise in such a circuit and we shall now show that such 
oscillations are inevitable. : 

To begin with, let us disregard thermal energy losses. Then, the 


law of conservation of energy requires that the following equation 
be satisfied: 


W=+ _ +5 ! LJ? =const. 


The sum of the electric energy and the magnetic energy is the same 
at each instant. Hence, the derivative of the above expression with 
respect to time is equal to zero: 


dy =2 pid =0 
dt dt . 
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Since the current strength is equal to the decrease in charge on the 
condenser plates, i.e., 


the equation may be simplified as follows: 
Q dl. 
etLla-=0. 


Such a relationship between the charge on the plates of a condenser 
and the current strength, which is equal to the derivative of the 
charge with respect to time, can be satisfied only if harmonic oscilla- 
tion of the charge and the current is assumed. 

This becomes evident upon comparing the above relations with 
the equations for mechanical vibrations (see p. 87): 


dQ dx 
far aE 
dI 1 dv 
a C Q, a kz 


Charge and current, on the one hand, are analogous to displacement 
from equilibrium and velocity of motion, on the other. As for the 
parameters of the system—inductance is analogous to mass and 
reciprocal capacitance is analogous to the rigidity of the system. 

Let the initial time equal the instant when the condenser is fully 
charged, and assume that 


Q= Qo Cos we, 

Then, 
I= — Qow sin wt. 
Substituting in the differential equation, we obtain 
— LQow* cos wt = a Qo cos wt 
or, after cancelling, 
ene 
Vic’ 


Thus, irrespective of the initial charge on the condenser plates, the 
harmonic oscillations occurring in the condenser have a natural fre- 


quency @) = Tare The smaller the capacitance and inductance 


of the circuit, the higher the frequency of the electric oscillations. 
What is the situation in a real circuit where the thermal energy 
losses cannot be neglected? Clearly, the total energy of the system 
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in this case will decrease in accordance with the equation 
; 1 , dQ dl 
, ny £ 2RP— 
dw =—FPRdt, i. e., —PR=7Q—-+LI =. 


Differentiating again with respect to time and using the relationship 
between charge and current, we obtain an equation of the form 


d2J dl 1 
Lay tRa+qei=0. 


At this point, an analogy should be drawn between the corresponding 
electric ‘and mechanical quantities. Comparing the last equation 
with the equation for mechanical vibrations with friction (p. 94), 
it is seen that the electric resistance is analogous to the coefficient a, 
which is a measure of the mechanical resistance. 

The solutions of such linear differential equations are considered 
in courses in advanced mathematics. We shall simply give the final 
result, which incidentally is easily verified by substitution in the 
above equation: 


I= Iye—®' cos wt. 
The frequency of oscillation is given by 
o=Var—p. 


Thus, the process is determined by two characteristics—the natu- 


ral frequency of free undamped oscillations, @) = Vib and the 


damping coefficient, B = s. It is seen, first, that light damping is 


achieved by decreasing the resistance relative to the inductance. (To 
be sure, this is not easily done, for if we increase the number of turns 
on the coil, both quantities increase simultaneausly. However, L 
does increase at a faster rate.) Secondly, it will be noted that when 


wr <P, i. e., 4L<CR?, 


oscillations become impossible. The discharge of the condenser 
under such conditions leads to an aperiodic process analogous to 
the return swing of a pendulum displaced in a viscous medium from 
its equilibrium position. 


Example. Assume that we have a variable condenser whose maximum capac- 
ilance C = 900 pf. Calculate the corresponding inductances of the radio coils 
for the 1,500 metre and 15 metre wavelengths. 

1. The frequency of electric oscillations corresponding to A; = 1,500 metres 
is vy = 2 X 10° cps = 200 ke/s. Since 


= 204, = then Ly =—,~= 1.2 x 10-3 henry=1.2 millihenry. 
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In order for the process in the circuit to be periodic, the resistance of the 
circuit must be less than 


R,=2 V/ 54 =3,000 ohms. 


2. Ag=15 metres, ve= 2107 cps= 20 me/s, Ly=0.12x10-@ henry =0.12 
microhenry. In order for oscillations to be possible, the resistance of the 


circuit must be less than R,=2 y 2- 30 ohms. 
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In a system in which the oscillatory circuit consists of a condenser 
and a coil (particularly if the condenser is composed of large plates 
separated by a short distance and the coil has a large number of 
turns), the electric and magnetic fields are concentrated in their 
respective regions. Therefore, it is possible to consider the electric 
and magnetic energies as two related, but nevertheless distinct 
quantities. This division loses physical significance to a large extent 
when we consider rapidly varying fields, where large electric and 
magnetic ficlds exist in the same regions. 

Recalling what was said in Sec. 113 about the relative nature of 
the division of an electromagnetic field into electric and magnetic 
components, it should be understandable that it is necessary to 
introduce into the theory the concept of an electromagnetic energy 
that is formally equal to the sum of the electric and magnetic energy 
of the field. The density of electromagnetic energy in space is 


= = (cL? + pH), 
while the electromagnetic energy contained in the volume V is 


/ 1 e a 
W == \ (eZ? + pH) dV. 
Vv 


In rapidly varying fields, the physical significance of the transfor- 
mation of magnetic energy into electric energy, and vice versa, is 
lost. At the same time, any energy transformations occurring in an 
electromagnetic field must be taken into account in the energy balance 
by a single electromagnetic energy quantity. 

If the above expression for electromagnetic energy is assumed to he 
valid, then, using the electromagnetic field equations of the preced- 
ing chapter, the following theorem for the decrease in electromagnet- 
ic energy within a certain volume of space can be rigorously proved: 


dW 
—< =(P—Q) + $K cosads. 
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This theorem was proved in 1884 by Poynting. (It was proved in 
a more general form, i.e., not in connection wilh an electromagnetic 
field, by N. A. Umov in 1874.) The integral on the right is the flux 
of the veclor A.* Using calculations that we have been forced to 
omit due to their complexity, it can be shown that this vector is 
perpendicular to the plane passing 

through the field vectors £ and 4 E 

a 130), and is equal to. A = 


LE 7H] in the Gaussian system 


of wile and to WT = (4H) in the 
practical system of units. 

Since Lhe values of the field inten- 
sities decrease quite rapidly with 
increasing distance from the field 
sources, the flux of the Poynting 
veclor is reduced to zero when all 
of space is taken into account. In 
this case, the theorem states: The 
change in electromagnetic energy is 
equal to the excess of the work of the Fig. 130 
applied forces over the released heat. 

However, of most interest is the application of the theorem to a 
finite volume, i.e., when the flux of the Poynting vector is not equal 
to zero. If the volume under consideration does not include currents, 
the equation assumes the form 


_— ay _ pK cosa ds. 


The change in electromagnetic energy is equal to the flux of the 
Poynting vector through the surface bounding the volume under 
consideration. 

The Poynting vector characterises the flux of the electromagnetic 
energy and the last equation expresses the following fundamental 
concept: A change in electromagnetic energy within some volume is 
accompanied by an outflow or inflow of an equivalent amount of 
energy. 

In essence, Poynting’s theorem is a direct consequence of the law 
of conservation of energy and the postulate stating that electromag- 
netic energy can be localised in space. 


* It should be recalled that in mathematics an expression in the form 
AdS is called the flux of vector A through the surface S. 


theo 
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If Poynting’s vector really has the significance of energy flux it 
should be related to the energy density as follows: K = vow (cf. 
p. 1148 where an analogous problem is considered relative to the 
propagation of elastic waves in a medium). By means of Maxwell's 
theory, we can determine v, the propagation velocity of the electro- 
magnetic energy. It turns out that 


c 

Ven” 

Thus, in vacuum electromagnetic energy should be propagated at a 
velocity c = 3 x 10° cm/sec, which agrees excellently with expe- 
riment. The coincidence between the values of 
ec determined from purely electrodynamic expe- 
riments (e.g., the measurement of the interac- 
tion between two currents) and the value of this 
constant determined by direct measurement of 
the propagation velocity of electromagnetic 
waves is remarkable and may be taken as practi- 
cally conclusive proof of the validity of Maxwell's 
theory. 

In a medium, the value of the propagation ve- 
locity of an electromagnetic wave is c divided by 
V ep. We shall see below under what conditions 
this relationship is satisfied and it shall be 
explained why certain deviations occur. 

Let us now return to the consideration of 

Fig. 131 energy transformations in finite regions of space 
that contain conduction currents. 

Assume that in the region under investigation there exists a cylin- 
drical conductor of radius r through which a current of density 7 
flows. The intensity of the magnetic field at the surface of the conduc- 
tor (cf. p. 279) is equal in the Gaussian system of units to 7 = 2 rj 
and the magnetic flux lines form circles about the current axis. 
It can be seen (see Fig. 131) that Poynting’s vector is directed into 
the conductor, for the field intensity and the current vector have 
the same direction. As for the numerical value of Poynting’s vector, 
we obtain (at the surface of the conductor): 


v= 


¢ ant Es EG bee 
gq HHH 3 = Py an 


Now, let us determine the flux of Poynting’s vector in a conductor 
segment of length J. This flux is equal to 


72 2 
Kx dart = art = VY, 
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where V is the volume of the conductor segment. But / is simply the 


thermal energy released per unit volume of conductor. We have thus 
shown that the flux of the Poynting vector enters the conductor and 
transfers to it an amount of energy that is exactly equal to the energy 
expended in heat. 

Where does this flux come from? In exactly the same manner as 
above, it can be shown that the flux of energy comes from those por- 
tions of the conductor where applied forces are present. 

This picture explains how electromagnetic energy is propagated 
along conductors. When electric power is transmitted from Kuiby- 
shev to a consumer in Moscow, the energy is delivered by electro- 
magnetic waves and not by the first electrons initiating the motion 
along the conductor. 


Examples. 1. Let us determine the order of magnitude of the electromotive 
force produced in the antenna of a radio receiver located at a distance R = 
= 100 km from a transmitter whose power is P = 100 kw = 105 joules/sec. 

The numerical value of Poynting’s vector at the location of the receiving 
antenna is 

. P 105 


Cate ag -7 7 2 ratt 2), 
K Tak? Tn (108) 8x 10-7 joule/metres? sec (watt/metre?) 


In the Gaussian system of units, the Z and H vectors have the same dimen- 
sions (gm’/? cm—*/2 sec“). It can be shown that for an electromagnetic wave 
propagating in vacuum the numerical values of the £ and # vectors in the 
Gaussian system of units are equal: E’=H’. For these quantities, the following 
relationships exist between practical and Gaussian units: 


1 volt/metre =-> xX 10-4 statvolt/em; 1 amp/metre= 4x2 x 10-3 oersted. 


Then the numerical values of the Z and H vectors in the practical system 
of units are: 


E =e x 10-42’; H=4n x 10-30’. 


Therefore, for an electromagnetic wave (£’ = H’), we obtain: E = 120nH. 


M 


In the practical system K = EH; hence, oe and E=//1200K = 


1207 
= 1.7 X 10°? volt/metre. 
This means that the potential difference produced in a receiving antenna 
having a length of 1 metre is of the order of 20 mv. 
2. Compare the value obtained above for K with the value of the solar 
constant, i.e., the energy that would arrive from the Sun each second on 1 cm? 
of the Earth's surface if there were no losses.in the atmosphere: 


K gun =0.15 watt/om2 = 1,500 watts/metre?. 
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419 Momentum and Pressure of an Electromagnetic Field 


rding to the theory of relativily (see Pp. 423), matter which 

Acco energy also possesses mass. The relationship between mass 

Oss ese. is given by the equation EF = mc*, where c is the propaga- 

ane ay of light. As we already know, the energy of an electro- 

Hien tie field may be considered to have the following density 
hood 


distribution in space: 


1 Di 9 
w =< (eB? + pH), 


Thus, a unit volume of electromagnetic field possesses a mass of 


Ww 
m = oe’ 

Since moving matter possesses mass, it must also have a momen- 
tum equal to the product of the mass and the velocity of motion. 
We conclude, therefore, that a unit volume of electromagnetic field 
has a momentum 


w 
=m=—. 
g c 


This expression is appropriately called momentum density. 

As stated earlier (p. 319), since Poynting’s vector has the sig- 
nificance of energy flow, it must be related to the energy density in 
accordance with the formula K = we. Comparing the last two for- 
mulas, we see that the relationship between the momentum density 
and Poynting’s vector is given by the expression g = K/c®, where c 
is the velocity. 

Since a flow of electromagnetic radiation possesses mass and momen- 
tum, it will exert pressure on a surface placed in its path. The mag- 
nitude of this pressure may be expressed in terms of the momentum 
density and may vary depending on whether the surface absorbs or 
reflects the wave energy. Of course, intermediate cases are also pos- 
sible. 

In the time At, the electromagnetic field included in a volume 
ScAt strikes the surface S. If total absorption occurs, a momentum 
equal to gScAt is lost in this time. But momentum divided by time 
is force, and force divided by area is pressure. Hence, the pressure 
exerted on the surface absorbing electromagnetic energy is equal to 
p = ge, the product of the momentum density and the velocity of 


light, or, since g = = , the pressure is equal to the energy density w. 


Now, let us consider an ideal elastic encounter between the field 
and the surface. If all the energy of the electromagnetic field (wave) 
is reflected, the change in momentum will be twice the incident 
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momentum, for the latter has reversed its direction. Just as in the 
purely mechanical cases (p. 68), the force of an elastic impact is 
twice as large as the force of an inelastic impact. Hence, the pressure 
exerted by the wave on an ideally reflecting plate is 


p=2ge or p= 2w. 


The formula for the general case is now easily obtained. If the 
plate reflects part of the energy and the coefficient of reflection is 
equal lo p, the pressure of the electromagnetic flux (wave) is given 
by the expression 

p=w(l—p)+ 2pw= (1 -+p)w. 


Using light, P. N. Lebedev verified these formulas experimentally 
in 1900 and thus greatly contributed toward the development of our 
present conception of the nature of electromagnetic waves. The pres- 
sure of light is exceedingly small even for the most intense sources. 
For example, the pressure of light on a mirror located at a distance 
of 1 metre from a “lamp” of 4 million candle power is of the order 
of 10-* dyne/cm?. That is why Lebedev’s measurement of the pressure 
of light with an accuracy of 1-2% is viewed as a great experi- 
mental achievement. 

Basically, Lebedev’s apparatus consisted of a pair of vanes at- 
tached to a light-weight suspension. One vane was an excellent absorb- 
er of light and the other an excellent reflector. The light was dizected 
first at one vane and then at the other, and, then, by comparing the 
displacement angles, it was possible to determine the magnitude of 
the force. The chief difficulty was how to take into account the ef- 
fect on the vanes of residual gas heating in the vessel containing the 
suspension. 

As we have just seen, the theory of variable electromagnetic fields 
has led to the conception of a field as a physical reality (electromag- 
netic radiation). The great meril of Lebedev’s experiments is that 
they provided direct proof of the validity of this conception. 

An electromagnetic field possesses energy and momentum, is 
propagated in space wilh a specific velocity and exerts pressure on 
an obstacle. We shall see below (p. 560) that an electromagnetic 
field may be transformed into matter. All these facts taken together 
irrefutably prove that an electromagnetic field is a physical reality. 


‘ 


CHAPTER XVIII 


ELECTROMAGNETIC RADIATION 


120. Elementary Dipole 


Electromagnetic radiation occurs whenever a variable electromag- 
netic field is created in space. An electromagnetic field, in turn, 
varies in time whenever the distribution of electric charge in a system 
changes or the density of an electric current varies. 

Thus, every variable current and pulsating electric charge is 
a source of electromagnetic radiation. 

Magnetic and electric dipoles having variable moments—particu- 
larly the latter—are the simplest systems producing electromagnetic 
fields. A system consisting of a stationary positive charge about 
which a negative charge oscillates constitutes such an electric dipole. 
If the oscillation is sinusoidal, the dipole moment will also be sinus- 
oidal, i.e., it is represented by the formula p = py cos wt. This 
simple radiator model has very great significance since many real 
systems can be represented to a high degree of accuracy by ideal 
dipoles. 

It will be recalled (see Sec. 93) that the electric properties of 
a system whose “centres of gravity” of positive and negative charge 
do not coincide may be described in terms of the dipole moment of 
the system. But most radiators of electromagnetic energy are elec- 
trically neutral systems whose positive and negative charges are 
capable of being displaced relative to each other. This is primarily 
because atomic and molecular systems fall under this heading. An 
electron rotating about the nucleus of an atom is a system having 
a variable dipole moment, and a neutral molecule whose atoms are 
in a state of oscillation is also, frequently, a system having a variab- 
le dipole moment. However, our interest in the electric dipole extends 
further. In the following article, we shall see that a linear radio anten- 
na may be likened to a dipole. (Incidentally, the analogous terms 
“oscillator” and “vibrator” are somewhat broader in meaning than 
the exact term “dipole”.) 

Magnetic dipoles occur when the electric charge distribution and 
hence the dipole moment of the system remain unchanged while the 
current density and hence the magnetic moment of the system change 
with time. A typical example is a loop in which an alternating elec- 
tric current flows. If the current flows in a closed circuit, the electric 
charge is neither accumulated nor dissipated anywhere. The electric 
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dipole moment of such a loop equals zero and does not change. 
However, the loop’s magnetic field, which is related to the value 
of its magnetic moment, varies and, therefore, electromagnetic 
energy is radiated. It follows from the theory that if a system pos- 
sesses simultaneously an electric and a magnetic moment the radia- 
tion from the magnetic dipole at large distances from the source is 
usually much less than the radiation from the electric dipole. 

If a dipole radiates by giving up internal energy or, as in the case 

of an antenna, by transforming the energy of an external source 
into radiation energy, the dipole is called a primary radiator. How- 
ever, a secondary radiator is also of considerable interest. In this 
Case, a dipole is made to oscillate by the action of an electromagnet- 
ic wave and becomes a radiator only as a consequence of this action. 
Secondary oscillations are particularly intense when the primary 
wave is of the same frequency as the natural frequency of the dipole 
(resonance). 
' Setting a dipole into an oscillatory state may be viewed as a mechan- 
ical process—the jostling of the charges by an external force equal 
to the product of the charge and the field intensity. At the same time, 
the process of creating secondary oscillations in a receiving antenna 
may be viewed as an induction process in which an alternating 
electric current is produced by an alternating magnetic field. To the 
extent that the antenna may be replaced by a dipole, both views 
are equivalent. 
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An important difference exists between the state of oscillation of 
an oscillatory circuit (p. 313) and the oscillation of the current in an 
antenna. In discussing the electric oscillation of a circuit, we referred 
to a definite instantaneous current strength and a definite instan- 
taneous charge on the condenser plates. It was assumed that the cur- 
rent strength in all parts of the circuit was the same, that the elec- 
tric charge was concentrated on the condenser plates and, hence, at 
a given instant could only have a single value. 

The electric oscillation in the case of an antenna cannot. be viewed 
in the same manner as the oscillation of a pendulum. However, the 
Oscillation of an electric current in an antenna does have a mechan- 
ical analogue. This oscillation is very similar to the vibration of 
a rod or string, i.e., it can be represented by a standing wave. 

This may be strikingly demonstrated by showing that an excited 
antenna has current nodes and antinodes. A small bulb may serve 
as the current indicator (Fig. 132). It turns out that a conduction 
current antinode exists at the centre of a free section of line in which 


electromagnetic waves are excited and that conduction current 
21* 
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anodes exist at the ends. In such a line, the current at all points is 
unidirectional at each instant. At some instant the current al each 
point decreases to zero and then begins Lo flow in the opposite direc- 
tion. The electric charge, which is distributed continuously along 
the line, varies accordingly. Clearly, as long as current flows in one 
direction, posilive charge is accumulated on one half of the line 
and negative charge is formed on the other. When the current decreases 
to zero, the charges at the ends are a maximum and of opposite 
sign. The current then begins to flow in the opposite direction and 
the charges decrease, becoming zero when the current strength in all 


Fig. 132 


parts of the line is a maximum. At this instant, the recharging proc- 
ess commences, charges of opposite sign are accumulated on the 
two halves of the line, etc. 


The reader will note that at each instant charges ‘tf opposite sign 
are located on the two halves of the line. Two charges that are equal 
in magnilude but opposite in sign, and separated by acertain distance, 
constitute an clectric dipole. It can he ‘stated, therefore, that the 
electric oscillations of an antenna are very similar to the oscilla- 
tions of an electric dipole in which the dipole moment decreases 
from a maximum positive value to zero, then increases in the oppo- 
site direction, then again decreases, etc. 

The field of an antenna differs from that of a dipole only in the 
region close to the antenna. At distances hundreds of times greater 
than the dimensions of the antenna, the field created by the antenna 
does not differ from the field created by an ideal electric dipole. 

Let us again return to the analogy between an antenna and a rod. 
The natural frequencies of electric oscillations that can exist in 
free, ungrounded antennas are not restricted to the frequencies deter- 
mined from the simplest case, i.e., when a half wavelength is impressed 
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on the length of the antenna, although such half-wavelength di- 
poles are mainly used in UHF engineering. The following relationship 
exists between the length of the antenna and the wavelength: 


Ls n< (sce p. 131). Thus, an antenna of length Z can receive and 


radiate waves of wavelength 4 satisfying the above relationship. 

In the field of radio, a number of methods exist for varying the 
natural frequencies of an antenna. Basically, they consist in the 
connection of a self-inductance coil or a condenser to the antenna. 
By varying the inductance or capacitance, the natural frequencies 
of the antenna may be varied within broad limits. 
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The radiation pattern of a dipole may be determined experimental- 
ly. It turns out that the results are in complete accord with the theo- 
ry first advanced by Hertz. We shall be concerned only with the 
results of experiments and theoretical calculations, restricting our- 
Selves to the field far removed from the dipole, i.e., to the so-called 
wave zone. This is the region in which the distances to the dipole 
are considerably greater than the dipole dimensions. 

Irrespective of the complexity of dipole oscillations, the oscilla- 
tions may always be resolved by means of Fourier’s theorem into 
their spectra, i.e., they may be represented as the sum of harmon- 
ic oscillations of frequencies w, 2, 3, etc. Therefore, it is quite 
sufficient to consider the electromagnetic field of a dipole whose 
moment varies in accordance with the harmonic relation p = 
= Po cos wl. 

Calculations and experiments show that the field of such a system 
may be represented by a spherical wave propagating with a velocity 


c . . 
v =. ——. The electric and magnetic vectors of the wave are at 


EL 
right angles to each other and also at right angles to the direction of 
propagation. The latter circumstance, incidentally, follows from 
Poynting’s theorem. 

In the wave zone, the electric and magnetic vectors vary in phase, 
performing harmonic oscillations at every point in space. A simple 
relationship exists between the numerical values of the field inten- 
sity vectors, namely: 


VeE=V ope. 


Hence, Poynting’s vector may be written in the following form: 


Ka Ge tater. 


ft AT 


326 ‘Electromagnetic Radiation 


Thus, the wave intensity, i.e., the energy passing through unit 
area per unit time, is proportional to the amplitude squared of the 
electric field intensity. _ 

The radiation of a dipole is not the same in all directions. The 
amplitude, as well as the intensity, depends on the ‘angle of inclina- 
tion of the propagation direction to the axis of the dipole. In the 
direction perpendicular to the dipole axis the radiation is a maxi- 
mum, while in the direction of the dipole moment it is equal to 
zero. Theory gives us the following expression for the electric field 
intensity: 

Pow? 
E— c2R 
where the factor before the cosine is the wave amplitude of vector 
E. The angle @ is the angle between the propagation direction 
and the dipole axis. The expression for the 
magnetic field deviates from the above 
only with respect to a slight difference in 
the amplitude factor. 
Fig. 133 shows a diagram sometimes used 
to represent the dependence of radiation 
intensity on direction. Here, a radius vec- 


sin 6 cos w (2 — 4) 5 


Af2 


Fig. 133 Fig. 134 


tor is shown intersecting the radiation pattern. If the scale is known, 
the radiation intensity is given by the length of the vector meas- 
ured to the point of intersection. 

The fact that the amplitude is proportional to the radiation fre- 
quency, squared is very important. Clearly, the radiation intensity 
of the dipole depends very greatly on the frequency, i.e., it is pro- 
portional to the frequency to the fourth power: 


ot . 4 
~ =; sin? 0. 
K Te 8 
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Thus, when the frequency is halved, the intensity decreases to a 


of its original value. 
Theory has led to an important conclusion regarding the orthogo- 
‘nality of an electromagnetic wave. This is illustrated in Fig. 134, 
where it is seen that the electric and magnetic vectors are perpendicu- 
lar to the direction of propagation. As.a result, the properties of an 
electromagnetic wave change when the wave is turned about the 
direction of propagation. This phenomenon is known as polarisation. 
Mapping the flux lines of a radiat- 
ing dipole is of no particular inter- Ses 
est. Fig. 135 shows vectors of elec- 
tric field intensity for several points 
in space. The field is rotational and Sas 
the flux lines are closed. When radia- W\ 
tion occurs, the close lines expand in 
the direction away from the radiator. NY 
The magnetic lines of force consist of 
circles about the dipole axis. At great A 
distances from the dipole, tlre spheri- 5 II Z ttt 
cal wave practically does not differ 
from a plane wave. Of course, the 
orientation of the Z, MH and K vectors Y 
in a plane wave and the numerical if 


relations given above remain the same. Me Uf 
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According to theory, electromagnetic or 
radiation occurs when electric charges Fig. 135 
are accelerated nonuniformly. A uni- ‘ 
form or free flow of electric charge does not produce radiation. 
Charges moving under the action of aconstant force, e.g., charges 
describing a circle in a magnetic field, also do not radiate. 

In oscillatory motion, the acceleration is continuously changing. 
Hence, electric charge oscillations produce electromagnetic radia- 
tion. Electromagnetic radiation also occurs when charges are abrupt- 
ly decelerated. Thus, when a beam of electrons impinges on a tar- 
get, X-rays are produced. Electromagnetic radiation also occurs 
during random thermal motion of particles (thermal radiation). 
The pulsations of a nuclear charge produce an electromagnetic 
radiation known as y-rays. Ultraviolet rays and visible light are 
produced by the motion of atomic electrons. Electric charge oscilla- 
tion on a cosmic scale is exemplified by the radiation of radio waves 
by heavenly bodies. 
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In addition to natural processes in which various kinds of electro- 
magnetic radiation are produced, a number of experimental means 
exist for creating electromagnetic radiation. 

The main characteristic of electromagnetic radiation is ils fre- 
quency (in the case of a harmonic oscillation) or its frequency band. 
Of course, using the relation .c = vA, the length of the electromag- 
netic wave in vacuum may be determined if the radiation frequency 
is known. 

The radiation intensity is proportional to the frequency to the 
fourth power. Hence, very low-frequency radiation having wave- 
lengths of the order of hundreds of kilometres cannot be traced. 
Practically, the radio band begins with wavelengths of the order 
of 1 or 2km, which corresponds to frequencies of the order of 150 kc/s- 
Wavelengths of the order of 200 metres are in the medium-frequency 
band, while those of the order of tens of metres are in the shorl- 
wave band. Ultrahigh frequencies (UHF) are beyond the usual radio 
frequencies; wavelengths of the order of several metres and fractions 
of a metre, down to a centimetre (i.e., frequencies of the order of 
10*°-10"! me/s), are used in the television field and in radar. 

In 1924, Glagolyeva-Arkadyeva obtained even shorter electromag- 
netic waves. Electric sparks produced between iron fillings suspend- 
ed in oil served as her source. In this manner, wavelengths down to 
0.4 mm were obtained. Thus, an overlap was achieved with thermal 
radiation wavelengths. 

Visible light occupies a very small band of wavelengths—from 
7.6 x 107% cm to 4 x 10-5 cm. This band is followed by ultraviolet 
rays, which are invisible but very easily detected by means of labo- 
noe equipment. These wavelengths extend from 4 x 10-5 cm to 

~° em. 

Following the ultraviolet band is the X-ray band. The wavelengths 
in this band extend from 10-° cm to 10-° cm. The shorter the X-ray 
wavelength, the less the absorption by matter. Electromagnetic 
radiations of shortest wavelength, which are the most penetiating, 
are called y-rays (wavelengths of 10-® cm and less). 

The nature of any of the enumerated electromagnetic radiations 
may be completely determined as follows: First, the electromagnetic 
radiation is resolved into a spectrum by some method or other. In 
the case of light, ultraviolet rays and infrared radiation, this may be 
accomplished by refraction through a prism or by passing the radia- 
tion through a diffraction grating (see below). In the case of X-rays 
and y-rays, resolution into a spectrum is achieved by reflection from 
a crystal (see p. 385). The spectrum of radio waves is determined by 
making use of the phenomenon of resonance. 

The radiation spectrum obtained may be continuous or discrete, 
i.e., all frequencies may be present in a broad band of the radia- 
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Lion spectrum or Lhe spectrum may consist of individual sharp lines 
corresponding to very narrow bands of frequency. In the first case 
the speclrum is represented by a curve of intensity vs. frequency (or 
wavelength), while in the second case the spectrum is described by 
giving the frequency and intensity of the lines. 

Experiments show that electromagnetic radiation of given frequen- 
cy and intensity may not always have the same polarisation state. 
In addition to radiation in which the electric vector of the waves 
oscillates along a specific line (linearly polarised waves), radiation 
also exists in which linearly polarised waves turned relative to each 
other about the beam axes are superimposed. Hence, to completely 
describe radiation, it is also necessary to indicate its polarisation. 

It should be noted that even for the slowest electromagnetic oscilla- 
tions, the electric and magnetic vectors of a wave cannot be meas- 
ured. The above descriptions of a field are based on theory. Neverthe- 
less, in view of the continuity and unity of all electromagnetic 
theory, there is no reason to doubt their veracity. 

The assertion that one or another kind of radiation consists of 
electromagnetic waves is always based on indirect evidence. How- 
ever, since these hypotheses have so many consequences that are in 
complete mutual agreement, the electromagnetic spectrum hy- 
pothesis long ago became accepted fact. 


124. Quantum Nature of Radiation 


We have already noted (p. 156) that investigation of atomic 
phenomena has led to a law which states that the internal energy 
of a system cannot assume any arbitrary value, but is characterised 
instead by a system of energy levels. Energy radiation is related 
to the transition of a system from a higher level to a lower level. 
Inergy absorption is related to the transition to a higher level. 

This applies, in the first place, to electromagnetic radiation. The 
quantum nature of submicroscopic phenomena was discovered at the 
beginning of this century as a result of investigation of a number of 
conflicting facts regarding electromagnetic radiation. 

Thus, the emission of electromagnetic radiation of frequency v 
by a dipole occurs in quanta (packets) rather than continuously. 
A quantum of energy is equal to kv, where h is Planck’s constant and 
is equal to 6.62 « 10-* erg sec. 

The quantum nature of electromagnetic waves is manifested in 
absorption as well as radiation, for absorption too can only occur 
by means of energy quanta. If the value of an energy quantum is 
equal to the difference between certain energy levels of a system on 
which the wave impinges, the absorption process is quite pronounced. 
Such a process may be called resonance absorption. From the stand- 
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point of classical physics, such absorption occurs when the frequency 
of the external field is equal to the oscillation frequency of the par- 
ticles constituting the system. If the value of the electromagnetic 
wave quantum is less than the difference between energy levels. 
absorption cannot occur and the wave passes freely through the 
system. 

In quantum terms, the secondary radiation of a system is described 
as follows: A system absorbs a quantum of electromagnetic energy 
and is raised to a higher energy level. The system maintains this 
level for a certain period of time and then returns to its former 
energy level by giving up energy—again in the form of a quantum. 

Since a quantum of energy is equal to hv, it is immediately evident 
that the higher the radiation frequency, the more pronounced the 
quantum phenomena. Nevertheless, the quantum nature of radiation 
has already been observed in practically all regions of the electro- 
magnetic spectrum. It has even been possible to observe the quantum 
absorption of radio waves having wavelengths of several hundred 
metres (radiospectroscopy—p. 549). 

The presence of one or another spectrum of electromagnetic radia- 
tion depends, in the first place, on the arrangement of energy levels 
in the system under consideration and on the transition probabili- 
ties of the system from an n-th level to an m-th level. If these proba- 
bilities were known beforehand and the energy level diagram were 
available, it would be an easy task to determine the radiation spec- 
trum of the system. 

We shall repeatedly be dealing with problems of radiation and 
absorption of electromagnetic energy, but now let us consider some 
problems of electromagnetic wave propagation in which the quantum 
nature of radiation is not manifested when the phenomenon is not 
accompanied by the absorption and radiation of energy. 


CHAPTER XIX 


PROPAGATION OF ELECTROMAGNETIC WAVES 


125. Dispersion and Absorption 


In a homogeneous medium, the velocity of propagation and the 
direction of an electromagnetic wave do not change. The velocity 
of the wave is a maximum in vacuum. In a medium, the wave velo- 
city is 

a a 

Ven | 

and since in most practical cases jp = 1, we obtain 


v= 


c 
Veo 
The ratio of the velocity of wave propagation in vacuum to the 
velocity of propagation ina medium is called the index of refraction. 


Thus, from electromagnetic theory we obtain the equation x = Ve, 
which is quite valid for very long wavelengths. As the wavelength 
changes, the index of refraction changes. This dispersion is alien to 
Maxwell’s electromagnetic theory, which regards a medium as a con- 
tinuum and does not take into account the interaction of radia- 
tion and matter. Be that as it may, the equation n = Ve is not 
valid for rapid electromagnetic’ oscillations. 

When an electromagnetic wave is propagated through matter, the 
electric charges of the molecules are set into a vibratory state. Since 
an electron cloud moves freely as compared with heavy nuclei, 
electric oscillation consists in the displacement of the centre of 
gravity of the electrons relative to the stationary centre of gravity 
of the atomic nuclei’s positive charges. Designating the charge and 
mass of the oscillating electrons by e and m, respectively, the oscil- 
lation equation may be written in the form 


mx= —kx—eEy cos ot. 
Or, dividing by m and using the formula for the natural frequency of 
2 : , e k a 
oscillation, i.e., o; = —, we obtain 


m 


e 
r= —otx —— Ey cos wl. 
0 m 
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We have equated the product of mass and acceleration lo two forces— 
the restoring force—kz and the external, periodically varying force 
eE,) cos wt. This is the equation of forced harmonic oscillations. 
It is satisfied if 


= Xp COS Wl. 


After substituling in the equation, we obtain 


——— 50 


m 


XL) = —-—— 
0 OF — w2 


The dipole moment of a molecule is 


The polarisation vector, i.e., the dipole moment per unit volume, iS 
N times larger, where NV is the number of molecules per unit volume: 
Ne? 
pP=—,”_, E. 


5 — w? 


ve ecalling the formula relating the polarisation to the field inlen- 
sity, i.e., 


pa 2s, 
40 
it is seen that the permittivity of the medium has been expressed 
in terms of the parameters of the molecular dipole: 


AnN e2 
m 
e=1+4+——. 


wz — w2 


, 


_ The index of refraction of the medium should be equal to the square 
root of this expression. 

As shown in Fig. 136, the nature of the dependence is confirmed 
by experiment. Here, the index of refraction vs. frequency curve, 
based on the above formula, is compared with the curve based on 
measurements with specific substances.* What is the basic conclu- 
sion to be drawn from the experimental and theoretical results? In 
general, the index of refraction increases with increasing frequency 
in the entire frequency range, except for the region in the immediate 
vicinity of resonance absorption. This region is called the anomalous 


* A more exact theory results in agreement with the experimental values 
in the region close Lo wo as well. 
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dispersion region. A substance may have several resonance frequen- 
cies, which correspond to the differences between its energy levels. 
Hence, there will be a corresponding number of anomalous disper- 
sion regions. 

Thus, the index of refraction of a wave, and hence the velocity 
of propagation, greatly depends on the value of the wave frequen- 
cy relative to the natural 


Trequencies of the molecular : 
dipoles. n i ‘ 
Naturally, the capacity ' -——- Meory 


of a substance to absorb an —— Experiment 


eleclromagnelic wave de- 
pends on the same factors. 
Using the same reasoning 
as in the case of elastic 
waves (see p. 120), we arrive 
ata completely analogous 
formula: 


T=JIpe-!4, - 


which enables us to deter- 
mine the valueof the radia- 
Lion intensity J relative to Ww, 

the incident intensity J if : : 

the absortion coefficient pu Fig. 136 

and the layer thickness d 

through which the wave has passed are known. It should be recalled 
that the absorption coefficient is equal to the reciprocal of the layer 


thickness which decreases the radiation intensity to ~~ of ils original 


value. Due to the complex system of energy levels peculiar to mat- 
ter, the curve of the absorption coefficient plotted against the 
frequency of the incident wave may appear odd and “erratic”. 
Until now we have been considering dielectric media containing 
only bound electric charges. Other relations exist when an electro- 
magnetic wave is propagated in a medium in which a considerable 
number of free electrons are present. Such media include metals and 
the ionosphere—a region of free charges akin to a gas. Using the 
theory presented above, it-must be assumed that in the formula for 
¢ the nalural frequency Wo of a free charge is equal to zero (the fre- 
quency is proportional to the rigidity of the bond). The dielectric 
constant is then given by the formula 
AstN e2 
m 
w2 


ex=1— 


. 
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When w becomes sufficiently large, the index of refraction n = Ve 
approaches unity. But when w? < 4nNe*/m, the index of refraction 
is imaginary. This means that for the given values of frequency the 
waves cannot penetrate the metal or the ionosphere. On the other 
hand, for high frequencies the waves are “indifferent” to the presence 
of a medium containing electrons. These predictions are borne out in 
the case of radio waves. Thus, long and medium waves are reflected 
from the ionosphere and do not penetrate it, short waves penetrate 
ve ta and uhf waves pass through the ionosphere unim- 
peded. 

The above presentation is greatly oversimplified. Hence, it should 
not be surprising that the conclusions are not valid for the optical 
region where the values of the index of refraction may be close to 
zero and also much greater than unity. 


126. Behaviour of an Electromagnetic Wave at 
the Boundary Between Two Media 


Just as in the case of an elastic wave, an electromagnetic wave is 
reflected and refracted at the boundary between two media. The basic 
F y laws of these phenomena 

may be subjected to theo- 
retical analysis by utilising 


4 the boundary conditions om 

a the electromagnetic field 
TET PT veclors. These conditions, 
wave wave discussed on pp. 259 and 


291, follow in turn from 

; Maxwell’s equations. Since 

Fig. 137 the relationships between 

the fields on either side of 

the boundary are not arbitrary, the division of the wave into reflect- 
ed and transmitted components is also not arbitrary. 

The two fundamental relations may be expressed as follows: the 
tangential components of the electric and magnetic vectors on either 
side of.a boundary must be equal. 

What restrictions are imposed by these relations in the simple 
case of normal incidence? This case is illustrated in Fig. 137. Assum- 
ing that the electric vectors are in the plane of the page, then the 
magnetic vectors are perpendicular to this plane. We know that the 
electric and magnetic vectors and the direction of propagation may 
be viewed as a right-handed screw system, i.e., the rotation of 
vector E by the shortest path toward vector H appears counterclock- 
wise to one facing the oncoming wave. To satisfy this requirement of 
electromagnetic theory, the direction of either vector H or vector E 
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must be reversed for the reflected wave. Thus, either the magnetic 
or the clectric vector undergoes a 180° phase shift when the wave 
is reflected. 

When considering oblique incidence, it is necessary to determine 
which of the two actually occurs. It turns out that both are possi-. 
ble—one when the wave passes into a medium of larger e« and the 
other when it passes into a medium of smaller ce. 

For normal incidence, the following calculations do not depend on 
the scheme chosen. Let us write the boundary conditions in the-form 


Eincia = Exepiect + Erefract 
and 
Hincia aaa Treftect + oer 


But the following relationship exists between.the numerical values 
of the H and E vectors: 


H=VeE=nE. 
Hence, we obtain two equations, 
Eincia = Eveftect -¢ Erejract 
and 
NE incia = — ME pe frect + N2E rejruct» 

whence the ratios Eyefiec:/Eincia and Erefract/Eincia May be deter- 
mined. Since the wave intensity is proportional to the amplitude 
squared and the index of refraction (p. 326), we obtain for the coeffi- 
cients of reflection and transmission the following simple formulas, 


n ‘6 . «. 
where n=—2 is the relative index: 
1 


Eveflect ) 2 = (x — 1)? 

incid (n+ 1) 

Evefract ) 2 ee 4n 
Eineid (n+ 4)? * 


coefficient of reflection = ( 


coefficient of transmission = ( 


The similarity to the case of elastic waves (cf. p. 128) is very great. 

By means of such calculations, the general results presented below 
were obtained for the oase of arbitrary beam inclination and polari- 
sation state of the wave. The agreement with experimental results is 
quite satisfactory. 

Since the sum of the reflection and transmission coefficients is 
equal to unity, the theoretical results are completely described by 
Fig. 138, where the intensity of the reflected wave is plotted as a 
function of the angle of incidence. 

Calculations and experiments show that the nature of the reflection 
curve depends to a significant extent on the polarisation state of the 
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incident wave relative to the plane of incidence. The electric field 
intensity vector E is “more important” than the Hf vector, if only in 
the sense that the photochemical action is due to E. Therefore, in 
describing the polarisation state of a wave, it is cusLomary to specily 
it with respect to the electric vector. The orientation of the H vector 
is always easily found if the direction of propagation is known. Thus, 
it turns out that the reflection coefficient is different for two waves 
that are incident at one and the 
same angle @ on the same boundary 
if in one case the electric vector 
isin the plane of incidence and in 
the other it is perpendicular to 
this plane. In the figure, curve 
corresponds to the case when the £ 
vector is perpendicular to the plane 
of incidence, curve JIJ corresponds 
to the case when the £ vector is in 
the plane of incidence and curve JJ 
corresponds to the case when the 
wave is not polarised. 

In the first case, the change in 
the reflection coefficient is monoton- 
ic—for normal incidence there is 
‘little reflection, the coefficient 
being of the order of 5%; then, 
with increasing angle the reflection 
coefficient increases, ever more rap- 
idly, until the glancing angle is 
reached. A beam whose electric vector is in the plane of incidence 
behaves in an entirely different manner. Its reflection intensity 
decreases with increasing angle until it reaches zero at the angle @p- 
This angle is determined by the following interesting equation: 
n = tan @g. The figure is plotted for the value n = 1.52 (transition 
from air to glass). Hence, the angle at which the reflection coefficient 
decreases to zero is equal to 56/40’. For a further increase in angle, 
the coefficient of reflection begins to increase and finally reaches 
unity. 

What is the reason for the absence of reflection in this particular 
case? How does this case differ from others? Evidently, the answer 
must be sought in the boundary conditions, from which the entire 
theory of the phenomenon proceeds. We leave it to the reader to 
construct the field vectors for this angle and illustrate the require- 
ment. ‘ 

The following question may arise in the reader’s mind: If the 
boundary conditions enable us to understand all phenomena at the 
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boundary between two media, then what about total internal reflec- 
tion where there is a field in one medium (£y; =4 0) but none in the 
other? The question is perfectly valid and the theory provides an 
answer. It turns out that under conditions of total internal reflec- 
tion the field penetrates the second medium but is not propagated 
deeply into the medium. The condition Fy, = Ey, is not violated. 

A number of experiments have been devised for the demonstration 
of light wave penetration into a second medium under conditions of 
total internal reflection. Suffice it to recall the basically simple 
experiment proposed by Mandelshtam. A glass prism is partially 
immersed in a solution of fluorescein—a substance exhibiting 
a characteristic fluorescence under the action of light. Then, a beam 
of light is directed onto the prism is such a manner that total reflec- 
tion occurs on the inner side of the prism surface that is immersed 
in the solution. The fluorescein thereupon glows intensely in an 
extremely thin layer next to the glass, proving that the electromag- 
netic wave has penetrated the solution. 


127. Natural and Polarised Light. Polarisation 
Upon Reflection 


Place a glass plate at an angle pz to a light beam. The beam is 
reflected. Then, if the beam is turned about its axis (actually, the. 
source of light is turned about the beam axis), it might be expected 
that at some position the beam will not be reflected. But if natural 
light is used for the experiment, this does not occur, i.e., for every 
azimuthal position of the incident beam, the reflected beam has the 
same intensity. It would be wrong to consider this a refutation of the 
theory presented in the preceding article. This experiment merely 
shows that the polarised state of a beam of natural light is more 
complex than given by the scheme of two vectors, # and H, having 
fixed directions of oscillation. 

Now, let the above beam, reflected at an angle gg, impinge on a 
second plate placed at a similar angle mz to the beam reflected from 
the first plate. Then, turn the beam about its axis. Since, of course, 
only the relative position of the beam and the reflector is of impor- 
tance, it is easier to turn the second glass plate. Investigation of this 
double reflection shows that the reflection varies with the position, 
and the position for which no reflection occurs is easily found. It is 
evident that this position corresponds to a mutual orientation of 
beam and reflector for which the electric vector of the beam is in 
the plane of incidence. The following conclusion may be drawn: 
reflection from the first reflector results in the natural beam acquir- 
ing a polarised state in which a single oscillation direction of the 
electric vector is separated out. 
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In contradistinction to natural beams, beams in which the vectors 
have a specific oscillation direction are referred to as polarised beams. 
How should the polarised state of a natural beam be envisaged? 
It is necessary to assume that in a natural electromagnetic wave all 
possible oscillation directions of the electric vector are uniformly 
present. The word “possible” should be underlined since electromag- 
netic theory shows that the electric vector is of a transverse nature. 
In essence, therefore, a natural unpolarised wave is a superposition 
of numerous linearly polarised waves having a uniform distribution 
as regards the vector oscillation directions. All transverse directions 
are electric vector oscillation directions of a beam of natural light. 

Reflection from two successive reflectors, fixed at an angle my, to 
the beams, is one method of polarising beams of light. 

An electric vector of natural light may always be resolved into two 
mutually perpendicular components. When reflection is being inves- 
tigated, it is most convenient to resolve each vector into two com- 
ponents—one in the plane of incidence and the other perpendicular 
to this plane. Thus, the behaviour of a natural beam may be equated 
to the behaviour of two such component waves, if we take into ac- 
count that the phase difference between them varies randomly. There- 
fore, in describing the polarisation of light, we say that one of the 
components has not been transmitted, or has been transmitted. to 
such and such an extent. If upon reflection or refraction one of the 
components of light is transmitted to a larger extent than the other— 
which the reflection curves show to be the case—this signifies that 
the light has been partially polarised. 

We can utilise this phenomenon to obtain total polarisation of a 
beam. Instead of using two reflectors fixed at an angle @, to the 
beams, it is much easier to transmit a beam through a pack of glass 
plates. Each refraction will increase the share of one of the compo- 
nents in the beam by a certain percentage. In this manner almost 
total polarisation may be achieved. 

The natural state of a light beam is unpolarised. However, this 
does not mean that every beam that has not been subjected to rellec- 
tion or refraction is unpolarised. This applies particularly to radio 
waves. The short electromagnetic waves used in the transmission of 
television are highly polarised. It is precisely this circumstance that 
enables us to determine the direction of the transmitter by the orien- 
tation of the receiving antenna. The electromagnetic waves which act 
as the carrier of a television programme are highly polarised. Hence, 
the antenna must be oriented in such a manner that the oscillation 
direction of the electric vector coincides with the antenna direction. 
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128. Propagation of Light Waves in a Medium Having 
a Refractive Index Gradient 


As arule, a difference in density is associated with a difference in 
refractive index. The natural question arises: What is the nature of 
the wave propagation in a medium in which the value of the refrac- 
tive index varies from point to point, i.e., the refractive index gradi- 
ent differs from zero? 

A difference in refractive index signifies a difference in the veloci- 
ty with which the wave front advances. It follows, therefore, that 
the wave front will be continuously deformed as it advances in such 
a medium. If we construct the normals to the wave front, we obtain 
a curved line. Thus, it can be stated that in a nonhomogeneous medi- 
um light is propagated curvilinearly rather than rectilinearly. 

The analogous problem for sound waves was discussed earlier 
(p. 147). The same laws are applicable here and the beam path is 
also determined by Fermat’s principle. A beam of light propagated 
in a finite medium having a refractive index gradient follows a path 
between two points that requires a minimum amount of time to 
traverse. Therefore, the beam of light bends so as to shorten its path 
in regions where the refractive index is large and lengthen its path 
in regions where the refractive index is small. 

The best example of the propagation of light in a medium of gradi- 
ent 7 is the passage of a beam of light through the Earth’s atmosphere. 
Since the density and the index of refraction of air decrease with 
increasing elevation, it follows that refraction occurs in the atmos- 
phere. A beam travelling from a star to the Earth and entering the 
atmosphere at an angle rather than along a radius will bend; hence, 
the apparent position of the star is displaced relative to its true posi- 
tion. For a star at the zenith, the displacement angle is as much as 
'/, of a degree. 

Mirages are caused by the presence of a refractive index gradient 
in the atmosphere. They occur in the African deserts due to the fact 
that heat currents are easily formed above the hot sand, resulting in 
temperature gradients and, hence, density and refractive index gra- 
dients. As a result, the light beams travel along curved lines and 
a landscape seems to appear where the observer, accustomed to the 
rectilinear propagation of light, conceives it. 

Of course, in the case of light propagation in a nonlromogeneous 
medium, the waves are neither spherical nor planar. It should be 
recalled that a variable propagation velocity signifies that the wave- 
length also varies from point to point. What is the equation of wave 
motion in a medium where the refractive index changes from point 
to point? Since the parameters of the wave change from point to 
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point. the equation being sought must be a differential equation, for 
only a differential equation can express the dependence between the 
physical quantities for a given point in space. 

This equation may be found by means of Maxwell’s equations. 
Since the derivation is rather complex, it will not be presented here. 
The calculations yield the following relation, which is valid for the 
E vector (or its projection) as well as the Ht vector (or its projection): 

aay 1 024 


ds? 2 te * 


The function ¥ is called the wave function. It represents the E 
vector, the H vector, or their components since the equations are the 
same for each case. Here, s is the coordinate in the direction of wave 
propagation, ¢ is the time and v is the propagation velocity. 

This equation is called the wave equation and it is valid for all 
points in space located outside the sources of the field, i.e., outside 
charged regions and regions in which electric currents flow. 

First it will be shown that the above differential equation is satis- 
fied by the simplest wave process, i.e., a plane wave. As we know 


(p. 114), the expression for a plane wave of frequency w, propagating 
in the direction s, has the form 


- ¥ = Acoso (¢—=) : 


Let us determine the second derivative of the wave function ¥ with 
respect to time and with respect to the coordinate. We obtain 


ay nL 
ee haa ec 8 


It is seen that the following relationship must exist between the 
second derivatives: " 


ay 1 ay. 


ds? yp ez? 
hence, the equation of a plane wave is embodied in the proposed 
differential equation. However, the above differential equation em- 
braces much more. Any function of argument (:—=) is a solution 


of the equation since for any function V (¢—<) the derivative 
expressions in WY are the same. 
The dependence of a function on the argument («—=) is regarded 


as the sole indication of a wave process. The significance of this 
argument consists in the following: If the state at a point s = 0 
is characterised at the instant of time t = 0 by a certain value of the 
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wave function, then the same state occurs at point s,; at the instant 
of time ¢, =-t , at point so at the instant of time t= , etc. Here 


sis the coordinate reading along any rectilinear or curvilinear path. 
The differential equation 


is the general equation of a wave process and is valid,for any medi- 
um, including nonhomogeneous medium where v varies from point 
to point. : 

If it is necessary to express the wave function in terms of the three 
space coordinates zx, y, z, the generalised wave equation has the 
following form: 

ay , aw ew 1 ey 


Ox? oy? dz? v2 ae 


~ The sum of the second partial derivatives of a function is concisely 
designated by the symbol AW (read: Laplacian of VY). Thus, 


The differential equation of a wave is valid for any process in 
which the value of the wavelength and the amplitude of the wave 
vary from point to point. 

Let us designate the amplitude of the wave function ¥ by yp. For 
most problems, it is primarily p that interests us. If a vibratory 
process of frequency occurs in a region, then, in the most general 
case: 


Therefore, a wave function will always satisfy the equation 


pS SOs 
Uv 


2 


The part of the expression for Y that is a function of time always 
cancels in such an equation. Hence, the last equation is the equation 


for the wave amplitude . By means of the relation A= ye it may 
also be written in the form 
Ap +s p=0 
brarb=90. 


Sometimes this equation too is called the wave equation. 
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129. Propagation of Radio Waves 


Radio waves are propagated in accordance with the laws of reflec- 
tion and refraction. In order to obtain the concrete results required 
for our discussion, it is merely necessary to generalise the theory to 
the case of a medium having a continuously varying coefficient of 
refraction. But this has already been done for elastic waves (see 
p. 145) and is completely applicable for electromagnetic waves, 
i.e., for light as well as radio waves. A wave travelling in a medium 
of variable n, i.e., a wave travelling with variable velocity, is pro- 
pagated in such a manner that the least amount of time is taken to 
traverse the distance between two points. The path of the wave will 
be curvilinear and in passing from one layer of the medium to another 
layer where n is greater the wave will be deflected toward the normal 
to the boundary. 

In order to determine the nature of the radio-wave propagation, 
the electrical properties of the Earth and the atmosphere must be 
known. The electromagnetic field of the wave is greatly affected by 
the magnitudes of the electrical conductivity and the dielectric 
constant of these two media. 

How is the difference in behaviour of electromagnetic waves of 
different length explained? Of course, a significant role is played by 
dispersion. But an approximate indication of the behaviour of an 
electromagnetic wave may be obtained from an examination of the 
relationship between the displacement current and the conduction 
current. It is evident that a medium exhibits dielectric properties 
when the displacement current is much greater than the conduction 
current. On the other hand, if the displacement current is negli- 
gible, the medium may be considered to be a conductor. 

The properties of the Earth’s surface and the properties of the 
atmosphere must be examined from this standpoint. 

Let us take a typical example. Experience in the field of radio 
engineering has shown that a flat terrain covered with trees may be 
characterised by a dielectric constant ¢ of the order of 12 and a spe- 
cific electric conductivity y of 7 x 10° (in the Gaussian system). 
To study the propagation of waves over the surface of a sea, it is 
important to know the values of « and y for sea water. These values 
are 80 and 10", respectively. The ratio of the conduction current 
density to the displacement current density (see p. 319 for the re- 
quired formulas) is given by the formula 

Jeond _ Bi os 0.7 >< 10-10 YA 

Jdispl c €& & 
in the Gaussian system of units. For long waves, say 2,000 metres, 
this ratio is equal to 77 for a wooded area and to 1,600 for a sea 
surface. The medium in each case, but especially in the latter, may 
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be considered to be a good conductor. For short waves, say 20 me- 
tres, the first value decreases to 0.77 and the second to 16. This means 
that for short waves sea water continues to be basically a conducting 
medium, while a wooded area acts to a considerable extent as a 
dielectric. 

Waves propagating over a conducting surface “cling” to this sur- 
face. The electric flux lines approach the Earth at right angles and 
travel along the terrestrial surface. That is why an electromagnetic 
wave can easily travel around the globe. (It takes 0.13 sec to do this 


Earth 


Fig. 139 


and since this time can be measured quite accurately we can deter- 
mine the propagation velocity of radio waves.) This applies to long 
waves. Short waves cling only to sea surfaces. In other regions, they 
behave like perfectly free waves. When such a wave travels along 
the surface of the globe, it penetrates the Earth and is absorbed. 
Moreover, the higher the oscillation frequencies, the greater the 
absorption. 

A number of remarkable features in the behaviour of radio waves 
is explained by the presence in the upper layers of the atmosphere of 
a layer containing a large number of free ions and electrons. This 
layer is known as the ionosphere. Thus, the region in which an elec- 
tromagnetic wave travels may be roughly pictured as a dielectric 
pounded by two conducting layers. 

Ionisation of the atmosphere is not uniform, i.e., the number of 
free charges per unit volume varies from one layer to the next. As 
was seen in Sec. 125, the coefficient of refraction decreases as the 
number of charges increases. Since the coefficient of refraction of 
a conducting medium is less than unity, a wave entering the iono- 
sphere at an angle from a dielectric medium is deflected from the 
normal. The ionisation increases; hence, the deflection increases 
with each succeeding layer. 

Furthermore, as Fig. 139 shows, a wave may either pass through 
the ionosphere and recede from the Earth or, after being bent more 
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and more, return to the Earth. Roughly speaking—disregarding 
the nonuniformity of the ionosphere—a wave returns to the Earth 
if it strikes the ionosphere at an angle greater than the total internal 


reflection angle: 
P 4nN2e 
sin 0) >n= V1 ae eal 


For smaller angles, the wave is propagated into outer space. By 
repeatedly being reflected from the ionosphere and the terrestrial 
surface, short waves can round the globe, experiencing considerably 
less energy losses than in the case of long waves. 

Since uhf waves can pass through a layer of free charges, they are 
not reflected from the ionosphere. Therefore, radio reception on uhf 
is possible only along the line of sight. 

The above picture of the atmosphere is greatly oversimplified. 
Investigations have shown that the density distribution of free 
electric charges in the atmosphere is characterised by several maxi- 
ma since the ionosphere is composed of several layers. The stability 
of these layers differs, depending on the time of year. It is interesting 
that the existence of the layers is related to solar activity. Thus, 
variations in the state of the ionosphere corresponding to the 14-year 
sun-spot cycle may be observed. Ionisation of the upper layers of the 
atmosphere is undoubtedly related to the arrival of cosmic radiation 
on the Earth. 

From a study of the electrical properties of the ionosphere and. 
the surface of the Earth, radio engineers have drawn a number of 
conclusions regarding the most favourable conditions for radio 
transmission and reception on waves of various lengths. However, 
we shall not go into this subject. 


130. Radar 


A radar station consists of transmitting and receiving equipment. 
Every ten-thousandth of a second (A in Fig. 140), the transmitter 


Fig. 140 


sends a pulse of duration a (of the order of several microseconds 
into space. If.an object capable of reflecting the wave is intercepted 
in the solid angle “illuminated” by the radio waves, a part of the 
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wave is reflected back to the radar station. The reflected signal is 
received talk sec after the pulse is transmitted into space. 


This time may be measured by means of an oscilloscope. The 
sweep of the electron beam is synchronised with the transmitted 
pulses, and the demodulated signal from the receiver is fed to the 
second pair of oscilloscope plates. As a result, a “pip” displaced 
relative to the initial point of the sweep by a distance proportional to 
the time t appears on the oscilloscope screen. If the object intercepted 
by the radar pulse is stationary, the “pip” on the oscilloscope screen 
will also be stationary. Thus, synchronisation is achieved when the 
sweep time equals one ten-thousandth of a second, the interval of 
time between successive transmitted pulses. If the object “seen” by 
the radar is moving, then the pip on the oscilloscope screen also moves. 

Modern radar systems are much more complex than indicated by 
this simplified picture. The motion of the electron beam of an oscil- 
loscope from the centre to the edge of the screen is more complex 
than motion along a radius. While moving outward along the radius, 
the electron beam slowly rotates about the centre of the screen like 
the hand of a clock. This rotation is synchronised with the rotation 
of the radar antenna in such a manner that the illuminated line 
points in the same direction as the transmitted radio beam. In addi- 
tion, the following important change in the operation of the oscillo- 
scope is introduced: If the radio beam does not encounter an obsta- 
cle, and hence the receiver does not pick up a reflected signal, the 
oscilloscope screen remains dark. On the other hand, if a pulse is 
received, a spot on the screen is illuminated. 

Thus, when a beam scans the horizon and encounters a body, this 
body is indicated on the oscilloscope screen by an illuminated spot. 
The distance of this spot from the centre of the screen is proportional 
to the distance of the radar from the object, and the azimuthal angle 
indicates the direction of the object. 

Oscilloscope screens possess an afterglow; hence, an illuminated 
spot does not disappear while the radar is scanning the area, i.e., 
before returning to the same position. If the illuminated spot is due 
to the beam reflected from a fixed object, the image on the oscillo- 
scope screen is also fixed. If the object moves, a moving image will 
appear on the screen. 

Due to the difference in the reflection coefficients of various ob- 
jects, a characteristic picture of the region is depicted on the screen 
of a radar system with circular scanning. Rivers and lakes appear 
dark (little reflection), the Earth appears lighter and woods still 
lighter. Of course, metal objects are “seen” very clearly. 

The nature of the visibility varies in accordance with the wave- 
lengths used. Thus, for radio waves in the centimetre range, clouds 
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are seen very clearly. Since longer waves are insensitive to clouds 
and rain, radar systems operating on such wavelengths can be used 
in all kinds of weather if they are not intended for the specific pur- 
pose of detecting clouds. 

Radar principles find broad application in science and engineering. 
Thanks to radar, pilots experience no difficulty in conducting night 
flights and in landing on airports which are not illuminated. Radar 
is of great importance in meteorology. In addition to enabling us to 
detect rain and storm clouds at great distances or at night, which 
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is essential for weather-forecasting, radar can be used to track mete- 
orological balloons. Radar equipment installed on ships is a great 
aid to navigation safety, reducing to nil the possibility of accidental 
collisions of a vessel with other vessels or obstacles. In the field 
of astronomy, radar methods are used to determine the distance to 
meteors, and the direction and velocity of their flight. The waves are 
reflected, in the main, from the meteorite “trails”, which consist of 
ionised gases. The Moon, the Sun and the planets are all within the 
reach of radar. Radar astronomy is of great practical importance. 
It has enabled us to develop navigational instruments to determine 
the location of a ship, from observations made on heavenly bodies, 
in any kind of weather and at any time of day or night. 

A vast amount of the literature is devoted to radar problems. 
Since such problems rightly belong to the field of radio engineering 
and not physics, we have restricted ourselves to an elaboration of 
the principles of this remarkable development. 

Fig. 141 shows a block diagram of a radar system. 


CHAPTER XX 


INTERFERENCE PHENOMENA 


131. Addition of Waves from Two Sources 


First, let us consider two ideal sources radiating spherical waves. 
Assume both sources are oscillating synchronously. In this case, for 
waves of any type, a characteristic field is created in which bright 
and dark “fringes” appear where the waves reinforce and cancel each 

‘other, respectively. This phenomenon is most 
easily demonstrated by means of water waves. 

The mathematical calculations are straight- 
forward. Take any point a distance r,; from . 
one wave source and rz, from another. Then, 
maximum reinforcement of the waves occurs 
when the path difference 7; — rz equals a whole 
number of wavelengths, nd. On the other hand, 
the waves annul each other when the path 
difference equals an odd number of half- 


wavelengths (27-+ 1) = ; 


We know from analytical geometry that a 
curved surface all of whose points satisfy the 
following condition is a hyperboloid: the 
difference between the distances to two foci is Fig. 142 
a constant. In Fig. 59 (p. 123) a plane is drawn 
through the wave sources. Shown in this plane are hyperbolas— 
_loci for which the difference between the distances to the wave 
sources is a constant. 

Now, let us consider the pattern obtained in a wave field on 
a cylindrical screen whose axis passes through the radiators as shown 
in Fig. 142. (We shall assume throughout that the radiators are 
point sources.) This interference pattern will consist of alternating 
bright and dark horizontal lines since the conditions are exactly the 
same for all points on the cylinder located at the same height, i.e., 
relative to the radiation sources all such points are in the same state. 
A bright band will appear along a line around the middle of the 
cylinder; since the distance from both sources is the same, the waves 
reinforce each other. For points at a height z above the mid-line, 
the difference in the paths traversed by the rays, ry; — rz, will be 
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represented by “——=. But rj—r,=2lz, where J is the distance’ 


between the sources. Thus, the condition for the n-th bright fringe 
has the form 


rye 
If the radiators are far from the screen, then for fringes close to 
the centre, 


Try +1, ~ 2R, 
where R is the radius of the cylinder. The bright bands pass through 
the points z satisfying the condition 


zZ 
Rr =n. 


The distance between adjacent fringes is Az = 

Example. If two coherent sources (see Sec. 132) ‘separated by a distance 
1 = 1 mm radiate light of wavelength A= 6,000 A, then the distance between 
interference fringes on the surface of a cylinder of radius R = 41 metre is Az = 


If the light source radiates waves of various wavelengths, the 


interference pattern will be coloured since the maximum conditions 
differ for different values of i. 


In addition to determining the positions of maximum and mini- 
mum interference, it is also of interest to determine the form of the 


F intensity curve across‘ the fringes. 
Since a is the path difference 
between the waves, then 


is the phase difference, and the 
total amplitude at any point is 
given by 

Acos wt + A cos (wi + §). 
For equal amplitudes, we obtain 
the expression derived on p. 103: 


6 6 
2Acos—cos (wi+— =]. 
Fig. 143 2 ( ape ) 


The measured intensity (wave 
amplitude squared) is equal to the average value of this expres- 
sion taken over the oscillation period. 


Since , 
[ cos? (we ab 4) Jew aie 
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(see the following article regarding calculation of the average), then 


— 9 42 eo? 
IT=2-A? cos Wik 
The intensity curve may be plotted as a function of the vertical 
coordinate z (Fig. 143). 


132. Coherence 


The superposition of two waves, described in the preceding section, 
may be physically achieved by various means and for various wave- 
lengths. For example, two antennas radiating radio waves may be 
placed close together, two electric bulbs with point filaments may be 
placed close together, or an incident ray and the same ray reflected 
from a mirror may be brought together. Experiments show that by no 
means does interference occur in all cases. This phenomenon can best 
be investigated by examining the superposition of the fields of two 
antennas. It is easily shown that an interference pattern is obtained 
only when a phase difference that remains constant during the time 
of observation exists between the superimposed waves. In this case 
the oscillations are said to be coherent. 

If the phase difference is fixed, the amplitude of the electromagne- 
tic oscillations at a given point in space is constant. Thus, a maxi- 
mum point remains a maximum and a point where the waves comple- 
tely cancel each other always has zero intensity. 

When the phase difference varies randomly, the pattern is comple- 
tely different. During a certain interval of time the amplitude of 
the oscillations at a given point is a maximum, in the succeeding 
interval it assumes intermediate values, and then for an interval of 
time the waves cancel each other. If the duration of these intervals 
were commensurable with the practical capabilities of instruments, 
a fluctuating interference pattern could be detected. If the variations 
in phase difference are so rapid as to preclude detection by these 
instruments, the interference pattern is not revealed and the average 
value of the intensity is shown on the instruments. In such cases, 
we say that the oscillations are noncoherent. 

What is the expression for the average intensity in a region where 
fields are superimposed? This is easily determined. 

The amplitude of the total wave at a given point and at a given 
instant may be expressed in the form 


A; cos wi + A, cos (wi + 8). 


The instantaneous intensity is proportional to this expression 
squared, i.e., it is equal to 


A} cos? wt + A cos? (wt + 6) + 2.4; de cos wt X cos (wi + 6). 
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We are interested in the time-averaged intensity of the radiation, 
i.e., 


I = Aj (cos? @t)qv-+ Ag [cos? (wt +8) ]av-+2Ay Ao [Cos wt X cos (wt+-5)]ar- 


The average values of trigonometric quantities are encountered quite often 
in physics. It is therefore useful to recall that the average values of sin x and 
cos x are equal to zero, and the average values of sin2z and cos?z are equal 
to 1/2, if the argument z of the trigonometric function assumes all values with 
equal probability. The average value of a function { (x) is by definition 


Uf (Map LE EL edt Hi Cen) 


This formula can he used to calculate the average if the variable z assumes 
discrete values. But if the variable x is continuous and assumes all values in 
the interval from a to b, the formula for calculating the average value is ob- 
tained in the following manner. Divide the interval (b — a) into n segments of 
length Az. Multiplying the numerator and denominator by Az, we obtain 


— f(x) Azr+f (a) Az+ ... 
(f Qa a , 
Going over to the limit, this takes the form 


b 
U @lao= Go J 1) az. 


By means of this formula, we can calculate the average value of any func- 
tion of a continuously varying random quantity. In calculating the average 
value of a periodic function, a single period should be used in determining the 


limits of integration, for the average value of one period is clearly equal to 
the average value of any number of periods. Thus, for example, 


t 
[cos? z]qy= a \ cos? x dz =: + : 
0 
Let us write the formula for the intensity in the form 
IT = Aj (cos? wt)gy + Aj [cos? (wd + 8)]ay + Ay Ap [cos (2a¢ + 6) Jan + 
+ Aj,Ap (cos S)ay. 


Using our knowledge regarding the average values of cos x and cos? 2, 
we obtain: for a phase difference between two waves varying ran- 
domly, i.e., for noncoherent oscillations, 


T= 5 (dt + AD; 


on the other hand, if the phase difference is fixed, i.e., if the oscilla- 
tions are coherent, 


I= (At + 42) + AyAz cos 6 
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or, for equal amplitudes, 
) 


T=2A4° cos* > . 
This last formula is the same as the interference formula derived in 
the preceding article. 

Radio waves radiated by neighbouring antennas may be made 
coherent or noncoherent by technical means. 

As regards oscillations of light waves, at first glance it would seem 
that it is impossible to achieve coherence since the radiations of 
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individual atoms have no definite relationship to one another. The 
phases of the waves emitted by individual atoms are haphazardly 
distributed. It is quite natural that two light sources, no matter 
how closely they approximate point sources, do not yield an inter- 
ference pattern. Nevertheless, coherent light oscillations do exist. 
They occur for rays “taken” from one and the same light wave. 
Methods of artificially securing coherent sources are shown in 
Fig. 144. In one case two mirrors that are slightly inclined relative 
to each other are used and in the other a double prism (biprism) is 
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used to produce wave sources from two virtual centres. Interference 
fringes may be observed on a screen placed anywhere in the inter- 
ference field. The theoretical discussion presented above is entirely 
applicable to these cases; the nature of the fringes is determined 
by the distance between the virtual images of the light source and 
the distances from these images to the point of observation. 
It is quite clear why the two parts of the “divided” ray are coher- 
ent. Between any pair of atoms of a real source, there is no coherent 
relationship. However, by divid- 
"\, 2 ’, ing such a ray into two parts, 
we enable the radiation of each 
atom to interfere with itself. 
Thus, consideration of various 
cases of lightjinterference is re- 
duced to the investigation of va- 
rious cases of division of alight ray 
by reflection and refraction, and 
the successive superposition of the 
components of the divided wave in 
the region of the interference field. 
The size of the light source 
significantly affects the coherence 
of the “divided” ray. Let us assume 
that the light source is of length b 
Fig. 145 and that rays in the solid angle 
2u take part in the creation of the 
interference field (Fig. 145). Rays such as J and J’ emanate from 
a single atom and are therefore coherent. The same is true of 2 and 
2". Consider the interference due to the superposition of the fields 
of rays J, 2 and 1’, 2’. For interference to occur, the fields of the 
coherent beams J, 1’ and 2,.2’ must reinforce each other. The path 
difference A = b sin u, which exists between 7’ and 2’ as well as 
between J and 2, tends to prevent this. Moreover, interference be- 


comes possible only when b sin ho 


133. Interference in a Plate 


Let us investigate reflection and refraction of light incident on 
a flat plate of thickness d (Fig. 146). 

Assume a plane wave impinges on the plate at an angle i. The 
beam of light is reflected and refracted. Moreover, the refracted 
beam strikes the lower surface of the plate and is also reflected and 
refracted. As a result, there arise numerous rays parallel to the pri- 
mary reflected beam, and also numerous parallel rays transmitted 
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in the second medium. All these rays are coherent and a phase differ- 
ence exists between them. Hence, the conditions are present for 
interference in the reflected as well as in the transmitted rays. 

As is well known (see p. 336), the reflection coefficient is not very 
large—at least, for normal incidence. In this case, the intensity of 
each “succeeding” ray is much less than the aMtenstty of the preceding 
one. For example, for a reflection coefficient of 5%, the first reflected 

ray has an intensity of 

0.05 J). The second rellect- 
ed ray undergoes two re- 
fractions and one reflection. 
Its intensily is 0.95x0.95 x 
x 0.0575 = 0.04579. Thus, 
the intensities of the first 
lwo rays are practically the 
same. But the third ray is 
much weaker since it un- 
dergoes three reflections and 
two refractions. Itsintensity 
is equal to 0.95 x 0.95 x 
x 0.05 x 0.05 x 0.05 Jo, 
i.e., one fouwr-hundredth of the intensity of the preceding ray. 

Under conditions of small reflection coefficient, the phenomenon 
reduces to the observation of the interference of the first two rays. 

As regards the transmitted rays, under conditions of small reflec- 
tion coefficient the interference is not noticeable since the intensity 
of the second ray is one four-hundredth of the first, the intensity of 
the third is one four-houndredth of the second, etc. However, it is 
not very difficult to set up the experiment in such a manner that 
in the reflected as well as in the transmitted beam numerous inter- 
ference rays occur. 

If a monochromatic wave impinges on a flat plate, the interference 
pattern is determined by the phase difference between the first and 
second reflected rays. 

From the wave formula . 


-1 cos @ (c—=) ‘i 


it is evident that the phase of the wave, traversing the path zx with 
velocity v, changes by o-, or x, where Vis the wavelength in the 
medium. Designating the wavelength in vacuum by Ao and recalling 
that the refractive index 7 is equal to fo , the change in phase may 


2st 


be written in the form Ty ht The product nz is often called the opti- 


23-1409 
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cal path of the wave. If a wave passes through several media, its 

phase changes by = sg, where S = (myx, + mote +... .) is the opti- 
0 

cal path. ; 

The phase difference 6 between the interfering waves, which deter- 

mines the intensity of the resultant wave, is given by 

b= A, 
i.e., it is determined by the difference between S’ and S”, the optical 
paths of these waves: A = §’ — S". . 

Referring to Fig. 146, let us calculate A for the case which interests 
us. It is most convenient to express A in terms of the refraction angle 
r, the plate thickness d and the index of refraction n. As seen from 
the figure, 


A=2dncosr. 


However, in addition, it is necessary to take into account the phase 
jump occurring upon reflection (cf. pp. 335-36). In this respect, the 
first and second rays differ, for the first is reflected from the external 
surface of the plate, while the second is reflected from the internal 
surface. Therefore, the electric vector of one of the rays undergoes 


a 180° phase jump and the other does not. Thus, the resultant phase 
difference is 


6 = 2dncosr + x. 
0 


Maximum interference occurs when 6 = m2x, where m is a whole 
number; minimum interference occurs when 6 = mn. Therefore, 
3 eas P Ao - 
maximum condition: 2dncosr= may Sra 
minimum condition: 2dncosr = mAb. 


Thus, depending on the value of A, n, d and r, interference may 
cause the intensity of a wave reflected from a plate to be zero or 
a maximum. In an ideal experiment with a monochromatic beam, 
by varying the angle of incidence, for example, a reflected ray 
should alternately vanish and reappear. In an analogous experi- 
ment with a beam of white light, the plales should pass through all 
the colours of the rainbow in succession. 


134. Fringes Representing Equal Thickness and 
Fringes Representing Equal Inclination 


Several factors enter into the extremum condition 2dn cos r = mA. 
Hence, if they are varied simultaneously, a confused picture may 
result. The effect is clearest when all the parameters, except one, 
may be considered fixed. 
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If a plate has a variable thickness d, a constant refractive index 
and a practically constant angle of incidence (and hence angle of 
refraction) for the portion of the plate under consideration, the inter- 
ference will be observed in the form of fringes representing equal 
thickness. All parts of the plate having the same thickness d will be 
subject to the same conditions. Therefore, on an uneven plate, there 
appears a system of bright and 
dark fringes (or rainbowed in case 
of white light). These lines con- 
nect points where the thickness 
of the plate is the same. This 
explains the coloured fringes often 
seen on oily films spread on the 
surface of water. If a plate is 
wedge-shaped the fringes repre- 
senting equal thickness consist 
of straight lines. Such fringes Fig. 147 
may be easily observed on soap 
films. In the case of a vertical film, the soap trickles down and 
the film becomes thinner in the upper region; horizontal fringes 
appear on the film. 

When light impinges normally on a plate, cosr ~ 4 and fringes 
appear on the plate where the thickness d satisfies the relation 
2dn = mag. 

The difference in the thicknesses of the plate represented by adja- 


cent fringes equals fo a i.e., a half-wavelength. Thus, bright 


fringes representing equal thickness indicate nonuniformities in 
plate thickness of the order of a tenth of a micron. 

If the thickness from one point to another varies very slowly, the 
fringes may lurn oul to be very far apart. Thus, for example, in 
a dripping soap film a wedge may form having a 0.5-minute angle; 
in this case, as can be easily calculated with the aid of Fig. 147, the 
fringes will be 2 mm apart. 

If a wedge decreases lo zero thickness, the end of the wedge ap- 


pears dark in the reflected light since thicknesses less than 4 do not 


nj 


reflect light. The first bright fringe occurs for the thickness a= * 


(the path difference is twice as great since the return path must also 
be included in the calculation). The next bright fringe occurs for 
d=i, etc. Thus, the thickness may be determined by simply count- 
ing the fringes. 

The question naturally arises: Why are fringes representing equal 
thickness easily observed on thin films but not, for example, on a 
windowpane? The answer is that it is not possible to create the 


23* 
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ideal conditions whereby the single variable quantily is the plate 
thickness d. 

Let us consider the effect of a spread in the angle of incidence 
(refraction). If the angles vary from r; to rz in such a manner that 
on the interference maximum for r, there is superimposed the extinc- 
tion for rz, the interference fringes will be smeared. What is the 
value of the angular interval Ar = r, — r, that smears the pattern 
of fringes representing equal thickness? The value can be determined 
from the conditions: 


bf 


2dncosr;=ma and 2dncos rp = (m -- 5) nN 

whence 
2dn (cos r,— cos ry) = 4 ; 
For simplicity, let us restrict ourselves to the case of normal inci- 
dence; let 7; equal zero and rz a small value Ar. Then, 
a _ (Ar2y 1 

2dn=md and 2dn (4 <T) = (m +5) A. 

Hence, 


ety 


2dn a 


2 


4 


and since 2dn = mA, 


> 


1 


m * 


(Ar)? = 


If the plate is thin, the values of m are measured in units or tens of 
units. In this case, an angular spread of about a tenth of a radian, 
i.e., 5°-10°, does not smear the pattern. However, for a plate 1 mm 
thick, m is already of the order of 5,000. Here, an angular spread 
of the order of only a hundredth of a radian suffices to prevent obser- 
vation of fringes representing equal thickness. 

However, even if the geometry is ideal, relatively thick plates do 
not yield an interference pattern. This is due to the fact that real 
light is not ideally monochromatic. An ideal wave is an infinite 
sinusoid. But an atom radiates for a very short period of time. 
Experiments show that a real wave is a sinusoidal wave train whose 
length is of the order of ten to a hundred millimetres. In order for 
interference to occur, the path difference must not exceed the length 
of the sinusoidal wave train; otherwise, coherence does not exist 
between the superimposed waves. If the length of the sinusoidal 
waye train is equal to 100 mm, the path difference corresponds to 
a value of m of the order of 150,000. 
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Now, let us consider fringes of another kind, namely, fringes 
representing equal inclination. Such fringes may be observed when 
a beam of light with a conlinuous spectrum of incident angles im- 
pinges on a plate having parallel surfaces, i.e., a plate for which 
d is the same at all points (Fig. 148). 

Consider a beam of reflected rays contained in a given solid angle. 
Let us direct our atlention to those rays lying along generating lines 
of a cone whose axis is normal to the plate. All rays lying on such 
a cone have the same value of r and yield lines representing equal 
inclination. 

The differences in the method of observation of lines representing 
equal thickness and lines representing equal inclination should be 


Fig. 148 


noted. Since lines representing equal inclination occur at infinity, 
a lens must be placed in the path of the rays to make them visible. 
This enables us to observe curves representing equal inclination in 
the focal plane of the lens. For normal incidence, lines representing 
equal thickness may be observed with the naked eye on the surface 
of a wedge. If the light impinges on such a plate at an angle, lines 
representing equal thickness may be observed on the surface of 
a wedge only in the case of very thin films. Otherwise, the interfer- 
ence pattern is observed in two planes located above and below 
the wedge at a distance q= 
this formula — which we leave to the reader—plot a ray incident on 
the surface of the wedge at an angle i and the rays reflected from the 


, where a is the wedge angle. To derive 
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upper and lower surfaces. The observation plane of the interference 
pattern will pass through the point where the prolongations of the 
two reflected rays intersect. Note that the above formula is only 
valid for the case of an air wedge. 


135. Practical Applications of Interference 


Interference methods are widely used for the measurement of 
small distances and small changes in distances. They enable us to 
detect thickness changes of less than one-hundredth the wavelength 
of light. An accuracy of 10-? cm may be achieved in the measure- 
ment of the unevenness of a crystal surface by interference methods. 


Fig. 149 


Many applications are based on the use of curves representing 
equal thickness. This method is widely used in the optical industry. 
For example, in order to check the quality of the surface of a glass 
plate, an air wedge is created between the plate under test and a 
standard plate having an ideal flat surface and the fringes repre- 
senting equal thickness are examined. The air wedge is formed by 
pressingsthe two plates together along one edge. If both surfaces 
are flat, the lines representing. equal thickness will be parallel 
straight lines. 

Let us assume that the surface of the plate under test has a depres- 
sion or a bump. The lines representing equal thickness will then be 
distorted, i.e., they will bend around the defective area. If the angle 
of incidence of the light is varied, the fringes are displaced in one 
direction or the other, depending on whether the defect is a depres- 
sion or a bump. The patterns seen under a microscope in these cases 
are shown in Fig. 149. The first two pictures are those of defective 
samples. In the first the defect is located on the far right, while in 
the second the defect is on the left. The third picture is that of a 
sample without defects. 
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This method may also be used for very accurate measurement of 
the coefficient of expansion. For this purpose, an air gap must be 
created between the surface of the object under test and a perfectly 
flat surface. As the object expands, the thickness of the air layer 
changes and the fringes representing equal thickness begin to move. 
If a line is displaced to such an extent that the next one takes its 
place, the thickness of the air layer at this location has changed 


by ue If, as is usually the case, the measurement is performed using 


monochromatic light, the fringes are very sharply delineated and the 
displacement of a line by 
one-hundredth of the dis- 
tance between lines may 
be measured. 

Accurate measurement 
of the refractive index of 
a substance may be per- 
formed by means_ of an 
interference refractome- 
ter. In such an_ instru- 
ment, the interference 
between two light rays 
that are separated as 
much as possible is ob- 3 
served (Fig. 150). For this 
purpose, a thick plate is 
used and a_ convenient 
angle of incidence selected (for ordinary glass, the best angle is 
about 50°). The rays travelling between the plates are separated 
and the substance being tested is placed in the path of one of them. 
This changes the optical path of this ray and hence the path differ- 
ence between the intérfering rays. If the interferometer plates are 
exactly alike and perfectly parallel, the interfering rays cover the 
same distance and reinforce each other. If the plates are inclined 
with respect to each other, a path difference is created and the clar- 
ity of the field being observed is reduced. 

Such is the situation for a beam of perfectly parallel rays. But if 
a slightly divergent beam impinges on the plate, a system of fringes 
representing equal inclination is seen through the eyepiece. In this 
case, the variations in the optical path difference are conveniently 
determined by counting the interference fringes passing the cross 
hair of the instrument. 

Let us assume that a body of length 7 and refractive index n is 
placed in the path of one of the rays. If the refractive index of the 
medium is 79, the optical path difference changes by A = 1 (n — ny). 


Ut) Le 


Fig. 150 
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Therefore, * fringes should pass through the eyepiece of the instru- 


ment. The accuracy of this method may be easily gauged from the 
fact that a displacement of one-tenth of the distance between lincs 
is easily detected. For such a displacement, A = 0.14 = 0.5 X 
 10-° cm, which for a length 2 = 10 cm enables us to distinguish 
a change of 0.5 x 10-® in the refractive index. 

The well-known Michelson interferometer (Fig. 151) is used for 

the accurate measurement of length as well as for the determination 
of the velocity of light (see 
p. 417). In this instrument, 
a parallel beam of mono- 
chromatic light impinges 
on a glass plate having pa- 
\ rallel surfaces, one of which 
(> © is covered with a_ trans- 
i lucent layer of silver. This 
plate is placed at a 45° angle 
to the incident beam. As a 
result, the beam is divided 
ake into two parts. One part 
moves parallel to the pro- ° 
longation of the incident 
beam and the other is di- 
rected perpendicular to the incident beam (to the left). These 
rays impinge normally on two mirrors and return to the same points 
on the translucent plate from which they came. Each ray returning 
from the mirror is repeatedly divided at the plate. Part of the light 
returns to the source and the other part enters the telescope to the 
right. As a result, two coherent interfering rays appear in the field 
of the telescope. It is seen from the figure that after the first division 
at the lightly silvered surface the ray coming from the mirror opposi- 
te the telescope passes through the half-silvered plate twice. There- 
fore, in order to provide equal optical paths, the ray coming from 
the other mirror is passed through an equalising plate identical 
to the first plate, but without the translucent layer of silver. 

In the field of the telescope there appear lines representing equal 
inclination (rings), corresponding to interference in an air plate whose 
thickness is the difference in the distances of the mirrors from the 
translucent layer. The displacement of one of the mirrors by a quar- 
ter of a wavelength corresponds to the transition from a maximum to 
a minimum, i.e., it results in the displacement of the pattern by 
“half a ring”. Such a change may be easily detected by an observer. 
Thus, the sensitivity of an interferometer using rays of violet light 
is better than 1,000 A, i.e., 0.1 micron. 


Fig. 151 
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An ‘interesting application of the Michelson interferometer prin- 
ciple is the microinterferometer developed by the Soviet physicist 
V. P. Linnik. In this instrument, one of the interferometer mirrors 
is replaced by an object to be investigated. Lines representing equal 
thickness are observed in the image plane. Dimension details of 
0.4 micron yield a sharp transition from maximum to minimum 
illumination. A microinterferometer is generally constructed in the 
form of an adapter toa usual microscope and screws into the drawtube 
in place of the objective. 
Of great importance in sci- 
ence are interference mi- 
croscopes, which in principle 
are similar to the micro- 
interferometer. (In the mi- 
croscope, interference does 
not occur in the image 
plane, but occurs rather in 
front of the usual objective, 
i.e., the interference image 
has the same dimensions as 
the microobject.) A double- 
ray interference microscope 
does not yield a gain in 
magnification. The advan- 
tage of this method lies else- 
where. For example, we 
often encounter objects of 
microscopic investigation Fig. 152 
that are either entirely 
transparent or vary little in transparency from one point to another. 
Before this method was developed, the details of such objects could 
be made visible only by dyeing (generally speaking, different struc- 
tural elements absorb dye differently). But dyeing has little appli- 
cation in the investigation of living microorganisms. Fig. 152, which 
shows a microphotograph of frog’s blood magnified 300 times, illus- 
trates the possibilities of the interference microscope. The drawbacks 
of this method include considerable loss of light in the interference 
mechanism aid complexity of the microscope’s optical system. 
A further increase in the sensitivity of interference methods and, 
therefore, a further advance in the field of interference microscopy is 
possible by going over from a double-ray interferometer to a multi- 
ray interferometer. In double-ray interferometers, the screen illu- 
mination is proportional to 1 + cos kh, where h is the displacement 
along the screen. Due to the smoothness of the transition from maxi- 
mum to minimum illumination, a small displacement of the inter- 
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ference fringes is difficult to detect. In multi-ray interferometers, 
the situation is considerably improved. As an example, let us 
consider the Fabry and Perot interferometer. A Fabry and Perot 
interferometer (Fig. 153) consists of two rather thick glass or quartz 
plates partially silvered on their adjacent sides. These surfaces are 
perfectly parallel to each other and the region between the plates 
contains air. When a beam of light enters the interferometer, the 
beam is divided into a transmitted and reflected component every time it 
impinges on one of the layers 
of silver. Thus, in the trans- 
mitted as well as in the 
reflected light, there is ob- 
tained a set of coherent 
light beams whose intensi- 
ties decrease in geometric 
progression (see p. 352) and 
whose phases are displaced 
Fig. 153 in arithmetic progression. 
In order to secure interfer- 
ence of a large number of rays, the decrease in amplitude during suc- 
cessive reflections must not be large. This condition is met when the 
silver coating on the plates has a reflection coefficient of 0.9 or more. 
The intensities of the rays in the transmitted light are then quite 
low, but there will be little variation from ray to ray. This makes it 
possible for a large number of rays (as many as 10-15) to take part 
in the formation of each illumination maximum. 

he interference pattern is obtained in the form of the usual rings 
representing equal inclination, but with one very important differ- 
ence, namely, the principal maxima determined by the condition 
2dn cos @ = mid are now much narrower and their intensities are 
tens of times greater than the intensity of the background between 
them. Therefore, the interference pattern assumes the form of very 
narrow bright fringes separated by broad dark intervals. The dis- 
placement of such a narrow maximum may be determined much more 
accurately than the displacement of a fringe in a double-ray inter- 
ferometer. An analogous narrowing of maxima occurs when the 
number of slits in a diffraction grating is increased. 

Thus, multi-ray instruments sharply increase the sensitivity of 
interference methods. Such systems are indispensable in the investi- 
gation of the vertical structure of an object surface in reflected light. 
The magnification of details in the vertical direction may reach 
a value of 400,000, which makes it possible to reliably resolve details 
of the order of 5-10 A. This -is only several times greater than the 
distance between atoms! An example of such photography is the 
picture of the spiral growth of a crystal shown on p. 640. 


CHAPTER XXI 
SCATTERING 


136. Secondary Radiation 


Under the action of an electromagnetic wave, every molecule 
becomes a secondary radiator of electromagnetic waves. Due to the 
electric force, the electron cloud is displaced relative to the atomic 
nuclei and the molecule acquires a dipole moment varying in time as 
the frequency of the incident wave. The behaviour of such a mole- 
cule differs in no way from the behaviour of the elementary dipole 
discussed in Chapter XX. The intensity of the secondary wave is 
given by the formula derived on p. 326 (intensity ~ ss sin? 0) and 
the spatial intensity distribution of the secondary radiation is shown 
in Fig. 133. 

In a number of cases, which will be discussed below, the phenome- 
non of secondary radiation leads to various phenomena of electro- 
magnetic wave scattering. By scattering, incidentally, we generally 
mean any electromagnetic wave propagation phenomenon that is 
not included under refraction, reflection and rectilinear propagation. 

The intensity formula given above is valid for any electromagnetic 
wave. However, the fact that the intensity increases sharply with 
radiation frequency explains why the effects of wave scattering by 
a molecule are not detectable when the wavelengths are very long. 
The scattering intensity of visible light is quite sufficient to produce 
significant effects. 

Light wavelengths are hundreds and thousands of times greater 
than the dimensions of ordinary molecules. Therefore, all the elec- 
trons of a molecule are made to vibrate in the same phase by the 
external field. For light waves, ultraviolet rays and even very soft 
X-rays (i.c., X-rays of long wavelength), a molecule behaves like 
an elementary electric dipole. 

The picture changes considerably in the case of X-rays having 
a wavelength of the order of 1 A. Now, the dimensions of the mole- 
cule are larger than a wavelength and different portions of the mole- 
cule’s electron cloud vibrate in different phases. In order to deter- 
mine the intensity of the scattered wave, we must take into account 
interference effects occurring between waves scattered by different 
parts of the molecule. 

In principle, this is not very difficult. First, it is necessary to di- 
vide the molecule’s electron cloud into small volumes, The dimensions 
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of each such volume, Av;,, must be much less than a wavelength. 
Then, the electrons in this volume will scatter in the same phase. 
Designating the density of the electron cloud by p, we obtain pAr;, 
electrons in a volume Av,. The amplitude of the secondary wave 
created by the k-th: volume is proportional-to pAv,. The amplitudes 
of the scattered waves are added with due regard to the phase dilfer- 
ences between the elementary waves and this sum is then squared. 
It is found that the intensity distribution of the scattered wave 
differs significantly from the radiation pattern of a single dipole. 
This is understandable, of course, for there will be directions in 
which elementary waves scattered by different volume elements 
reinforce each other, i.e., act in phase, and, on the other hand. 
directions in which elementary waves tend to annul each other. An 
important conclusion to be drawn from such calculations is the fol- 
lowing: When interference occurs between elementary waves ema- 
nating from different volume elements of a particle, waves travelling 
backwards tend to annul each other in the final analysis; on the 
other hand, those travelling forward reinforce each other. 

We have been discussing secondary radiation from a molecule, 
but often the secondary radiator is a much bigger particle, namely, 
one composed of numerous molecules. It may be a particle of dust, 
colloidal substance, fog, crystalline substance, smoke, or a large 
albuminous molecule, etc. The nature of wave scattering by particles 
is determined by the ratio of their dimensions to the wavelengths 
of the exciting electromagnetic wave. If the particle is small relalive 
to the wavelength, the wave is scallered as by a single elementary 
dipole. If this is not the case, interference effects occur and the for- 
ward scattering predominates. 

Different parts of a particle may possess different scattering powers. 
This is precisely the situation in the case of X-rays scattered by 
a molecule. A particle whose scattering power is the same throughout 
the volume is the simplest body from the standpoint of scattering 
investigations. We shall confine our attention to such a syslem, 
for not only are the calculations simple in this case, but, in addi- 
tion, a system of this kind is easily simulated experimentally by an 
aperture in an opaque screen. 


137. Wave Diffraction at Apertures 


The amplitude of a wave scattered by a particle is determined by 
the distribution of scattering material in the particle. Particles 
(“apertures”) may be encountered in which the density of the scatter- 
ing material gradually decreases with increasing distance from the 
centre of the atom. On the other hand, more pronounced nonuni- 
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formities may be encountered, e.g., inclusions and pores at whose 
edges the density changes abruptly. 

Peculiar diffraction effects arise when scattering takes place on 
such nonuniformities. With increasing angle the scattering intensity 
lirst gradually decreases to zero. Then, as the angle increases further, 
the inlensily increases to a 
Maximum value, whereupon 
it again decreases to zero. 
Subsequently, the wave-like 
nature of Lhe curve continues 
with decreasing amplitude. 
Scallering on such objects 
leads to the formation of 
diffraction bands and spots 
of various shape depending 
on the nature of the scat- 
lering object. 

The most pronounced dif- 
fraction effects are observed 
for scattering at apertures 
made in an opaque screen. 
Every aperture may be 
viewed asa region uniformly 
filled with radiating dipoles. 
The scattering patterns of 
an aperture and a particle 
having the shape of such 
an aperture should yield 
identical curves of intensily 
vs. scattering angle. 

For light rays, diffraction 
patterns are best observed 
using parallel rays in accord- 
ance with the following 
scheme. The rays of a light 
beam emanating from a 
source are made parallel and allowed to impinge on a screen in which 
various inclusions (if the screen is transparent) or apertures (if the 
screen is opaque) are located. A lens placed behind the screen brings the 
Parallel rays into the plane of a photographic plate or screen for the 
observation of the effect. If there are no nonuniformities, apertures, 
etc., in the path of the rays, the lens gathers the rays in a point. 
Otherwise, a scattering or diffraction pattern appears on the screen, 

Fig. 154 shows the diffraction patterns obtained in this manner 
from (a) two needles and a thin wire and (b) a circular aperture. 
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To make these patterns more understandable, let us determine 
the intensity distribution of scattered radiation* for the simple case 
of an aperture having the shape of a slit. 

Let a wave impinge normally on a slit cut in an opaque screen. 

Considering the slit divided into volume elements AV as shown in 
Fig. 155, let us wrile the expres- 
sion for the wave emanating 
from an arbitrary volume element 
AV, at an angle @ to the incident 
wave. The waves from different 
volume elements AV, arrive at 
the point of observation in differ- 
ent phases. If the path difference 
is measured relative to the outer- 
most ray (on the side of deflec- 
‘tion), the rays emanating from 
the other volume elements will! 
cover a distance that is greater 
by the amount zsin @. Hence, 
these rays will be displaced in phase by = a, sin @. 

The amplitude of the wave scattered by the k-th volume element 
is proportional to the “scattering” volume element AV, i.e., to the 
expression 


Fig. 155 


ny 


It is necessary to take the summation of the expressions for all 
volume elements. Instead of taking the summation we may integrate 
with respect to x, where the z-axis is taken along the width of the 
slit. Replacing AV, by Az,, which is proportional to it, and going 
over to the limit, we obtain for the amplitude of the scattered wave 
at the angle q: 


2: : 
AV;, cos (ot——* x sin °) : 


a 


A=k cos (oe— = x sin °] dz, 


ee k is a coefficient of proportionality and a is the width of 
the slit. 


Introducing the variable 


2x 
ry 


* “Scattered radiation” and “diffracted radiation” have exactly the same 
physical meaning. The terms “diffraction” and “diffracted” are generally used 
when the scattering pattern has rather pronounced maxima and minima. When 
the nature of the interference pattern is not so evident, we speak of “scatter- 
ing”. 


Z=ot———xsing, 
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we obtain 


dz= _ sin pdx 


and, therefore, 
5 9- : 
Ae [ sin ot — sin (ot — — sin e) | ; 


Bail sin 
ry ? 
Designating <* sin by u and performing a trigonometric trans- 


formation, we obtain 
ha. 
A=—~sin u cos (wt — wu). 


Thus, the resulting oscillation at the point of observation has the 


amplitude #2 sin u, i.e., the observed intensity is 
sin®u 
f=iea ae 


This is the formula for the intensity distribution as a function of 


the scattering angle. 
In most diffraction experiments, we are interested in small values 


of the scattering angle @. The reasons for this will become clear 
later. Therefore, replacing sin @ by tan @ and since 


tan g=— 
where x is the distance of the point of observation in the plane of 


the photographic plate to the centre of the diffraction pattern and 
fis the distance from the slit to the plate, we obtain for u the expres- 


Sion 


pss 1a x 
=F: 
ne sin?u : F : om 
Fig. 156 shows the WE curve. Since wu is proportional to z, this cor- 


responds to the diffraction pattern appearing on the plate. 

The locations of the dark fringes are easily determined from the 
condition uw = + nx, where n isa whole number. Thus, the first zero 
occurs at 

peal 
a 
This value of x also represents the distance between two successive 
positions of zero intensity. 

From this formula, we can determine when diffraction phenomena 
may be detected for various wavelengths under various conditions. 
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The diffraction of light (A = 0.5 X 10-1 cm) may be clearly observed 
under laboratory conditions if the aperture is of the order of 0.1 cm 
and the distance between the screen and plate is of the order of 
2 metres. Since for these values x = 1 mm, the effect is clearly 
visible. 

Rays of visible light yield noticeable diffraction from a tennis 
ball (4 = 5 cm), but at a greater distance. If the distance f is equal 


to 100 metres and the wavelength 4 is 5,000 A, then x = 1 mm. 


I 


Fig. 156 


Thus, in this case too the distance between positions of zero intensi- 
ty of the scattered radiation is of the order of magnitude of 1 mm. 
Diffraction of radio waves may also be observed if a proper choice 


of conditions, as determined by the equation z a4 , is made. 


Assume that the values of f and 2 are fixed. The width of the slit 
greatly affects the diffraction pattern. If the width of the slit is large, 
x» 0, i-e., the slit image focussed by the lens is infinitely narrow. 
As the width of the slit is decreased, the diffraction pattern begins 
to take shape and the first diffraction minimum begins to move fur- 
ther and further away from the centre of the pattern. F inally, when 
the slit is made so narrow that our approximation in the formula 
for uw (the substitution of sin p by tan p) is no longer justified, the 
image of the slit on the screen becomes smeared. A still further de- 
crease in width, to the point where the wavelength and the width 
of the slit become equal, results in the slit yielding secondary radia- 
tion as a single source. The interference of primary waves disappears 
and the primary wave is radiated from the slit in all directions. 

For apertures and particles (or inclusions) having other shapes, the 
diffraction patterns appear entirely different (see Fig. 154). Never- 
theless, the general laws remain valid and the basic features of the 
pattern are maintained. Thus, for example, in the eaca of Attecatian 
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from a circular aperture or other circular nonuniformity, concentric 
rings are formed and the diameter of the smallest dark ring is 1.22.40 ; 


where D is the diameter of the aperture. 

Since diffraction patterns have maxima at different locations for 
different wavelengths, white light is resolved into its spectrum upon 
diffraction. Therefore, diffraction from a circular particle or aper- 
ture has the form of a rainbowed ring. 


138. A System of Randomly Distributed Scatterers 


We have considered the behaviour of various secondary radiators 
of electromagnetic waves as a function of the ratio of their dimen- 
sions to the wavelength of the incident wave. But the properties of 
a radiator are only roughly determined by its dimensions. The detailed 
pattern is determined by the distribution of matter in the scat- 
tering particle. Only when the dimensions of a particle are small 
relative to the wavelength is the distribution of matter in the par- 
ticle of no consequence. In this case, the particle scatters as a whole, 
i.e., as a single electric dipole. When this condition is not satisfied, 
the pattern is complex since it is determined by the interference of 
waves scattered from various volume elements of the particle. We 
have considered only one example of a scatterer whose dimensions 
are larger than a wavelength, namely, a homogeneous scattering 
particle, which may take the form of an aperture in an opaque screen. 

Now, let us consider the problem of scattering by systems of 
Particles—e.g., a system of gas molecules, specks of dust or smoke 
particles; a system of hoar-frost crystals on a windowpane; or a sys- 
tem of holes in a piece of gauze. In all such cases, the pattern 
becomes complex due to the fact that the electromagnetic waves 
emanating from the various scatterers may, generally speaking, 
interfere with each other. Now, the scattering pattern will depend 
not only on the properties of a scattering particle, but on the ar- 
rangement of the particles. Thus, it is important to know how close 
the scatterers are to one another and whether their spacing is regu- 
lar or random. Then, depending on the circumstances, the waves 
scattered by the various particles may interfere to a maximum 
extent, partially or not at all. 

Confining ourselves to the extreme cases, let us first consider scat- 
tering by a system of randomly distributed particles—e.g., the 
scattering of X-rays by a large cluster of atoms or molecules having 
a random distribution. 

In the case of a large number of identical scattering centres (e.g., 
atoms, molecules or larger identical particles), the resulting scat- 
tering is determined, as we have already stated, by the scattering of 
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_a single centre (region) and the arrangement of the scattering centres. 
The scattering pattern in the case of uniform distribution of scat- 
tering centres is quite different from the pattern in the case of ran- 
dom distribution. 

If the scattering centres are randomly distributed as, for example, 
in the case of gas molecules, the waves scattered by different centres 
may be considered to be noncoherent. This is because in the case 
of randomly distributed scattering centres the phase relationship 
between waves emanating from different centres is quite arbitrary. 
It is safe to say that the number of waves with positive amplitudes 
arriving at a point of observation (from different centres) will be 
exactly equal to the number of waves with negative amplitudes 
arriving there. The result is clear. Designating the amplitudes of 
the waves from the various centres by A,, Az, Az, etc., the total 
amplitude at the point of observation is 


A=A,+A,+A3-+... 7 


oe the intensity is proportional to the amplitude squared, we 
obtain 


P~ APS AL Abt AP4+ ... 424A, +244, +... +2AAg+ ..-- 


But among the double products there will be just as many positive 
terms as negative terms. Therefore, with a high degree of accuracy, 
the total is given by the sum of the amplitude-squared terms. In 
other words, the total intensity scattcred by identical randomly 
distributed centres isex- 
pressed as follows: 
I=NA*, 
where N is the number of 
scattering centres and A is 
the scattering amplitude of 
one centre. 

Thus, it turns out that 
scattering by a large number 
of randomly distributed par- 
ticles is very similar to scat- 

tering by a single particle. 
Film ‘The only difference is that 
the effect is V times greater. 


on Data on scattering by asin- 

Bigs to! gle molecule are obtained by 

= investigating the scattering of 

X-rays by a gas. Fig. “157 illustrates a laboratory set-up for the study 
of X-ray scattering by gases. A beam of X-rays is made monochromatic 
by reflecting it from a crystal, whereupon it is directed into a gas chamber. 
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Scattering is determined by its action on a photographic film, i.e., the degree 
of blackening is a measure of the scattering intensity. We can thus obtain the 
intensity as a function of scattering angle—rapidly decreasing smooth curves 
for monatomic gases and decreasing curves having small maxima for polya- 
tomic gases. By means of these curves and theoretical formulas, the electron- 
density distribution in a molecule may be determined. 


There are many examples of systems that scatter electromagnetic 
waves as a gas scatters X-rays. 

The scattering of light rays in aroom with dust is a familiar exam- 
ple. Through a chink in a window curtain, a narrow, rectilinear. beam 
of light visible from all sides penetrates a room. Light waves act on 
a system of dust particles in much the same way as X-rays act on 
a system of molecules. The distances between specks of dust are 
quite large and the distribution of the particles is completely random. 
Hence, there is no interference between the waves scattered by 
different dust particles, and the scattering pattern is similar to 
that created by asingle speck of dust. The sole difference lies in greater 
intensity, i.e., the intensity is proportional to the number of dust 
Particles in the field of the primary beam of light. Each speck of 
dust behaves as an elementary electric dipole, for the dimensions 
of such a particle are less than the wavelength of light. Therefore, 
the laws applicable to the scattering of light by dust particles, i.e., 
the dependence on the wavelength of light and the nature of angular 
distribution, are the same as for an elementary electric dipole (i.e.,. 
the intensity formula given on p. 326 and the intensity distribution 
shown in Fig. 133 are applicable here too). is 

This agreement can also be easily demonstrated by comparing the 
diffraction pattern from a single aperture with that from a system of 
randomly distributed apertures. Experiments show that, as regards 
the relative intensity distribution of scattered light, these two 
diffraction patterns are entirely alike. Of course, in the case of N 
apertures, the intensity of the light scattered by the screen is V times 
greater than the intensity of the light scattered by an opaque screen 
having one aperture. 

Since the scattering pattern due to a large number of randomly 
distributed centres has the same form as that due to a single centre, 
it becomes clear why a lantern observed through a window covered 
with hoar-frost has the familiar rainbowed halo. This pattern 1s 
simply the result of diffraction from the ice particles. Since ther 
distribution is perfectly random, they behav eas “circular” particles. 


139. Behaviour of a Perfectly Homogeneous Medium 


In this article we shall consider the other extreme case of the 
phenomenon of scattering. By a perfectly homogeneous medium we 
mean a system of scatterers that are distributed uniformly and 
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continuously throughout a given region. Transparent glass consti- 
tutes such a medium with respect to light waves. Since the wave- 
length of light is considerably greater than the distance between the 
atoms in glass, a piece of transparent glass may be considered to be 
divided into volume elements that are considerably smaller than 
the wavelength, but at the same time still containing a large number 
of molecules. The medium may be considered homogeneous if the 
number of molecules in all such volume elements are approximately 
the same. : 

The scattering of electromagnetic waves is not only a function of 
uniformity in regard to the number of radiators per unit volume but 
is also a function of uniformity in the orientation distribution of 
the radiators. In the final analysis, the scattering power of a body 
is determined by its dipole moment, which is composed of the indi- 
vidual dipole moments of the molecules contained in this volume- 
It may be stated, therefore, that the scattering power of a body is 
determined by the value of the dielectric constant or, since & = 2”, 
by its refractive index. Hence, for a given wavelength, a medium 
that is uniform as regards the scattering of electromagnetic waves 
must also be uniform as regards the refractive index. 

Observations on the behaviour of electromagnetic waves in per- 
fectly homogeneous media show that no scattering takes place in 
such media. Thus, when a ray of light passes through a transparent 
body, the ray cannot be seen from the side (compare this with the 
case of a light ray entering a room with dust).’ 

Each volume element of a homogeneous body is a wavelet source. 
Nevertheless, wave scattering does not occur. Only one explanation 
is possible, namely, the wavelets scattered by a homogeneous medi- 
um in any direction at an angle to the primary ray are completely 
annulled due to interference. This theorem is capable of rigorous 
proof but the proof will be dispensed with since it is quite evident 
that this is the only possible explanation. 

Nevertheless, the phenomenon of scattering does make itself felt 
quite considerably in a homogeneous medium. Scattered waves. annul 
each other in all directions except one, namely, the direction in 
which the primary wave is propagated. Forward scattering does not 
represent simply a superposition on the primary wave but a change 
in its velocity as well. 

It turns out that the phenomenon of electromagnetic wave refrac- 
tion, which we have already discussed, may be interpreted as a natural 
consequence of scattering. 

An electromagnetic wave propagated in a medium may be repre- 
sented as the sum of the primary wave and the scattered waves. 
Theoretical calculations show that the superposition of these waves 
leads to the retardation of the primary wave. 
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140. Scattering in a Nonhomogeneous Medium 


A substance uniformly distributed, in the sense that we have just 
discussed, does not scatter electromagnetic waves. Although all 
portions of this substance create wavelets, secondary radiation is 
not observed from the sides: No matter what point in the region we 
chose as our point of observation, it can be rigorously proved that 
the waves scattered by a uniform substance annul each other due to 
interference. Since for any particular wavelet there always corre- 
sponds another that is exactly opposite in phase, the net effect is 
complete cancellation. ‘ 

Now, let us assume that in some limited region the substance has 
a greater density than in the surrounding medium, i.e., that it has 
an excess of dipoles per unit volume. Then, all wavelets are annulled 
except those created by this excess density. As always, the scattered 
radiation is determined by taking the sum of the wavelet amplitudes, 
whereby the phase differences between the wavelets arriving at 
the point of observation must, of course, be taken into account. 

Is the situation any different if the density of the scattering region 
is less, instead of greater, than the surrounding medium? Scattering 
will cease if matter is added to such a scattering region until the 
medium becomes homogeneous. Clearly, the net effect is the same 
if we add a certain amount to a given quantity or subtract this same 
amount. Hence, the scattering from a region of lower density is equal 
to the scattering from the missing substance, i.e., the substance 
required to make the medium homogeneous. 

Thus, it is only important that the scattering region have a densi- 
ty distribution of matter that differs from the surrounding medium. 
Moreover, as regards scattering, the effect of a positive density 
deviation is indistinguishable from a negative density deviation of 
the same magnitude. For example, the scattering from porous glass 
is the same as the scattering from glass containing randomly dis- 
tributed inclusions of exactly the same size as the pores. 

Due to the large wavelengths of radio waves, scattering will take 
place in this case only when the nonuniformity of the density occurs 
on a relatively large scale. For example, in order for the scattering 
of kilometre waves to be detectable, the extent of the deviations 
from average density that are intercepted by the waves must be at 
least several hundred metres. The waves are unable to “detect” 
smaller inclusions or density gaps. 

The scattering of light waves is detectable when disturbances in 
the distribution of scattering matter are at least of the order of sever- 
al tenths of a micron. Thus, light waves are unable to detect non- 
uniformities in the distribution of electrons in a molecule or in the 
region between two adjacent molecules since these phenomena are 
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restricted to regions whose dimensions are much less than several 
tenths of a micron. The situation is different as regards X-rays. 
In this case, the wavelengths are of the same order of magnitude 
as the dimensions of an atom. Hence, an individual atom appears 
as an “inclusion in a void”. 

The scattering of light waves on nonuniformities is a common 
phenomenon. Nonuniformities in a scattering substance are easily 
recognised by the external appearance of the medium, i.e., by its 
turbid appearance. The conditions needed for light scattering prevail 
in opalescent glass, dust-laden air, etc. In all these cases, there are 
random disturbances of the density of the substance, whereby the 
Seen of the disturbed regions approach the wavelength of 
ight. 

As was indicated above, if a particle or nonuniformity scatters as 
a single dipole, in other words, if its dimensions are no greater than 
one-tenth to one-twentieth of the wavelength, the scattering inten- 
sity as given by the dipole formula (see p. 363) is proportional to 
the frequency, and inversely proportional to the wavelength, taken 
to the fourth power. This is the explanation for the following inter- 
esting phenomenon. When white light is scattered by a medium 
with nonuniformities, the medium acquires a blue coloration since 
the intensity of the scattered blue rays, i.e., those of shortest wave- 
length, is considerably greater than that of the others. On the other 
hand, after passing through a scattering medium, white light becomes 
reddish since the blue portion of the spectrum is impoverished 
due to greater scattering. 

For light waves, not only are turbid media nonhomogeneous. 
A homogeneous gas or liquid is optically nonhomogeneous due to the 
presence of density fluctuations. This may be shown by calcula- 
tions. Light waves scattered in a region whose linear dimension iS 
0.02 micron, i.e., one-twentieth of the wavelength 4, may be con- 
sidered to be in phase. There are, on the average, 215 molecules 
in such a gas volume (8 x 10-18¢m*) under normal conditions. The 
relative fluctuation in the number of particles according to the laws 


of statistical physics is ra , 1.e., approximately 4 per cent. This is 


a perfectly perceptible nonuniformity as the scattering of light by 
air shows. 

The blue colour of the sky is due to such scattering. If the atmos- 
phere did not scatter the light of the Sun, the sky would appear 
black. It should be noted that the colour of the sky is due to the 
scattering of a relatively small portion of the energy: ina unit volume, 
about 10-? of the primary wave energy is scattered. 

Scattering from density fluctuations in a substance’ is referred to 
as molecular scattering since this scattering depends on the molecu- 
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lar structure of the substance rather than its impurities. Investiga- 
tion of the molecular scattering of liquids is of interest as a method 
for the determination of certain features of molecular structure. 
As regards the nature of scattering, a nonhomogeneous medium in 
which the regions of deviation from the average density are suffi- 
ciently far apart and quite randomly distributed does not differ from 
a system of randomly distributed scattering centres (Sec. 138). For 
the most part, however, in discontinuous media (such as fluids and 
amorphous solid bodies in the case of X-rays, opalescent glass and 
colloidal systems in the case of light rays, and the atmosphere in 
the case of radio waves) the interference of waves scattered by neigh- 
bouring regions of lower or higher density affects the form of the 
scattering pattern. This type of interference yields a scattering 
pattern that significantly differs from the ideal scattering pattern 
obtained from a single electric dipole. 

We have considered scattering of electromagnetic waves by 
a system of randomly distributed particles, scattering in a perfectly 
homogeneous medium, and finally, as-an intermediate case, scatter- 
ing in a nonhomogeneous medium. One more important case remains 
to be discussed, namely, scattering of electromagnetic waves by 
systems of uniformly distributed centres. Under this case, the follow- 
ing systems will be considered: a diffraction grating for light waves, 
directional radiators for radio waves and crystals for X-rays. 


141. Diffraction Grating 


A diffraction grating may be constructed using a glass plate coated 
with a thin layer of aluminium. By means of a special device, uni- 
formly spaced lines are inscribed on this plate with a soft ivory tool. 
In such a “grating”, the nonuniformities (lines) are uniformly distrib- 
uted, leading to a number of light scattering peculiarities. 

We shall be speaking about an optical diffraction grating, but the 
discussion applies to any regular distribution of nonuniformities and 
scattering centres and to all electromagnetic waves, i.e., from the 
shortest to the longest (kilometre waves). The discussion will be~ 
restricted to diffraction using parallel rays, which may be realised 
as indicated in Sec. 137. 

If all scattering centres are identical, as is undoubtedly the case 
in an optical diffraction grating, the diffraction pattern may be 
determined in the following manner. 

Consider the amplitude of the wave emerging at an angle @ to the 
incident wave. The total amplitude is equal to the sum of the ampli- 
tudes of the waves scattered by the individual centres. If the waves 
from the individual centres arrived at the point of observation in 
phase, the total amplitude would be equal to the product of the 
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number of centres N and the amplitude of an individual centre f. 
However, the wave from each centre is displaced in phase relative 
to the wave from an adjacent centre, and it may be assumed that the 
magnitude of the displacement is the same in each case. Waves from 
different centres interfere with each other and the resultant intensity 
is equal to Lf? rather than Nf*, where L is a quantity greater than V 
for the directions in which the waves reinforce each other and less 
than N for the directions in which they arrive, in the main, out of 
phase and annul each other. 

The directions for which the waves from all centres reinforce each 
other may be easily determined with the aid of Fig. 158. The path 


Fig. 158 


difference between waves emanating from corresponding points of 
adjacent centres is equal to asin g. If this path difference is equal 
to a whole number of wavelengths, the waves reinforce each other: 
asin = nd (maximum condition). As may be seen, there are 
several such directions. If the wave impinging on the grating is not 
monochromatic, the grating resolves the wave into its spectrum. 
Moreover, there will be several spectra rather than just a single 
spectrum. The number nm appearing in the above equation is called, 
therefore, the order of the spectrum. 

Since it is somewhat cumbersome to calculate the intensity distrib- 
ution of a scattered wave, let us merely confine ourselves to one 
important problem, namely, the determination of the width of a 
diffraction maximum. We are interested in determining how rapid- 
ly the intensity of a diffraction maximum, which occurs at an angle 
@ satisfying the equation asin p=nd, decreases. Does one maximum 
immediately pass over into the next or is there a large interval be- 
tween maxima? The case of a grating consisting of a large number 
of scattering centres (lines) is of considerable practical importance. 
Imagine the grating divided into two parts. Now, compare a pair 


of rays coming from the first centre and the (+ +. 1) st centre, the 


second centre and the ( + 2) nd centre. At maximum reinforcement 


of the waves, the path difference between such pairs of rays is equal 
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to M nh. If the path of the rays is slightly changed, the rays being 
inclined so that the path difference increases by +, maximum 
reinforcement of the superimposed waves is replaced by total annul- 
ment. The first wave cancels the (F -+ 1) st, the second cancels 


the (5 +2) nd, etc. Exact calculations show that for positions 


further away from the maximum the intensity remains practically 
equal to zero until the angle of inclination p approaches the position 


of the next maximum. : 
The angle at which maximum diffraction of n-th order occurs is 


i . 1 
given by the formula sin @ = =. 


If we designate the half-width angle of the maximum by Ag, then 
for the angle (p + Ag) we may write the condition 


Ns. N A 
sz asin(ptAg)=yr+sz- 
Hence, 
é 1 r 
sin(p + Ag) ==" +r 
or 


‘ F x 
sin (p + Ag) —sing = yz 


The distance between two successive maxima is determined by the 
expression 

P 2 a 

sin Po — Sin Qi => 


a 
We see that the half-width of a line is, roughly speaking, a of the 


distance between maxima. When NV is large, i.e., in the case of grat- 
ings consisting of a large number of scattering centres, the diffrac- 
tion lines are extremely narrow and the resolution in the spectrum 
obtained from the grating is very fine. Imagine, for example, that 
light containing two close waves, A and A+ 6A, impinges on a grat- 
ing. For simplicity, assume we are concerned with scattering at 
angles less than 20°; hence, sin g ~ g. Then, in the n-th order these 
two lines are displaced by the angle 6, which, as can be seen from 
the condition 9 & sing = mh is approximately equal to =. 6. 
The width of the maximum for each wave may be determined from 
the equation , 

A 


sin (9+ 69)—sing ~ sQ=sr- 
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Evidently, these two lines may be distinguished (in optics, we 
say resolved) if 


n a 
q > Na * 


The expression x = nN is an indicator of the resolving power of 
the grating. 


Example. In a good diffraction grating, the distance a between lines is 
a~10-3mm and the number of lines NW is equal to 100,000. Then, the resolving 


power for the spectrum of second order is + = nN = 200,000. This means 


that for 4= 6,000 A, for example, two lines whose wavelength difference is 
0.03 A may be resolved. 


Let us consider the intensity of a diffracted beam. The waves 
directed towards a maximum point act in phase. If f is the amplitude 
of the wave scattered by one centre, the total amplitude in the maxi- 
mum direction is Vf and the intensity is N?f?. Thus, the height of 
a diffraction maximum is proportional to the number of scattering 
centres squared and, since the width of a maximum is inversely pro- 
portional to N, its area (i.e., the integral of the maximum intensity) 
is proportional to N taken to the first power. If different maxima are 
compared, it will be seen that the ratio of their heights (or, what 
amounts to the same, their areas) depends on the value for these 
directions of the amplitude f of the scattering from one centre. 

Thus, the period of a grating determines the locations of the maxi- 
ma, while the form (in the broad sense of the word) of a line or scat- 
tering centre determines the intensity of the maxima. 

Let us assume the angles qi, 2, @3, etc., are determined by the 
period of a grating. Scattered rays occur only at these angles. But 
what will be the intensity of these rays for the first, second, etc., 
orders of diffraction? This depends on the amplitude values of one 
scattering centre for these scattering angles. Thus, for example, the 
amplitude f may be a maximum at the angle 2. Then, the second 
order diffraction will be represented by an intense line. If at the 
angle @3, the amplitude f is close to zero, then the third order line 
will not appear in the diffraction spectrum, etc. This is illustrated 
in Fig. 159, which shows the diffraction spectra and scattering fac- 
tors f of one centre (dotted curves) for two different grating arrange- 
ments. ; 

These principles form the basis of any study of structure by means 
of diffraction spectra. The distance between diffraction lines enables 
us to determine the period of the grating—assuming, of course, that 
the wavelength is known—and the intensity of lines of different order 
enables us to determine the structure of a scattering centre. 
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Example. Consider a diffraction grating for which a = 3 x 10-3 mm and 


N = 1,000. Assume a parallel monochromatic beam of light (A = 5,000 A) 
impinges on this grating. Diffraction maxima are visible at the angles given by 
n 


sin == and the width of a diffraction maximum is equal to 


r 1 F 
2a = 3,000 ° The obtained results are valid for any scattering centre 


form. To calculate the relative intensity of the diffraction maxima, it is 
necessary to consider specific scattering centres. Let us examine two cases: 


1. The scattering centres are single strips of width bat a=0.75 x 
$3 


x40-3 mm (Fig. 1592). In Sec. 137, we obtained a formula for the inten- 
sity of a wave diffracted at a slit. The magnitude of the intensity is pro- 
portional to the amplitude squared of the scattering from one strip: {2 ~ 

sin’ , : ; F , ; 
~ — , where w= sing. The intensity of the n-th diffraction maxi- 


mum is determined by the magnitude of f?, in the direction @p, which is 


determined, in turn, from the equation sin m= . If f is taken as 100, 
the other intensities have the following values: 


{= 80; #3 =40; /3=8.9; f2=0; f2=3.2. 


2. The scattering centres are double [strips of width fia 0175S 


x10-3 mm each. The grating period a is the same as before (Fig. 1590). 
Clearly, the location and width of the diffraction maxima have not changed. 
Caveulations similar to those performed in Sec. 137 show that for two 
slits 


fa ~ Sint» {1400s [Fete sinjg |} 


From this expression, the relative intensities of the diffraction maxima are 
easily determined. Assuming, as before, that f% = 100, then 


i= 12; fg=20; [f= 7.5; f= 0; (2=2.7. 


142. Directed Radiators of Radio Waves 


In certain radio engineering applications, particularly radar, 
it is required to direct a radio beam into space in such a manner 
that the transmitted energy is concentrated in a very small solid 
angle. One solution of this problem is to utilise a linear array of 
antennas. 

In Sec. 141, we saw that for uniform spacing of scattering centres 
the radiated energy is concentrated in specific directions. If radia- 
tors of radio waves are arranged in a single row (see Fig. 160) with 
a distance a between adjacent antennas, and if all the antennas are 
fed synchronously, such a radiator array will in no way differ from 
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a scattering diffraction grating. The fact that we are dealing here 
with primary waves in no way detracts from the applicability of 
ihe discussion in the preceding article to the present case. It is only 
necessary that it be permissible to consider the radiation from differ- 
ent antennas coherent, which is the case if the antennas are fed 
synchronously from a generator of oscillations. 

If the antenna array is so dense that the distance between adjacent 
dipoles is less than a wavelength, even first, order diffraction is 


Fig. 160 


impossible, as the equation a sin p = nA shows. Only the zero order 
remains. This means that there are only two radiation maxima— 
one forms a 0° angle with the normal to the array and the other 
a 180° angle. The point made in Sec. 141 about the width of the 
maximum is valid here too, namely, the larger the total number 
of radiators, the smaller the solid angle in which the beam intensity 
reaches an appreciable value. &. 

However, in practice, it is inconvenient to haye radiation of equal 
intensity in two opposite directions. To avoid this, double arrays 
of dipoles are used (see Fig. 161). The antennas of each dipole pair 
are separated by a distance of 4/4 and a phase difference of 90° exists 
between the two currents. As a result, one of the two maxima is 
annulled and all the energy is concentrated into one diffraction 
maximum. The phase relationships existing for each dipole pair in 
such an array may be explained as follows. For the “forward” waves: 
if the waves were propagated synchronously, the path difference 
between them would be 4/4; but the antennas do not operate synchro- 
nously, i.e., the wave radiated by the “front” dipole lags by 90°, 
which compensates for its lead due to the path difference. The situa- 
tion is different for waves propagated “backwards”. The displace- 
ments due to the path difference and the phase difference between 
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the currents in the antennas are in the same direction. Hence, the 
total phase difference is 180°. As a result, there is no radiation in the 
backward direction. 


Fig. 162 


If the antennas are arranged in a row, a narrow beam may be 
obtained in the plane perpendicular to the antennas. But if a small 
beam in space is desired, a more complex system of oscillator ele- 
ments must be used. That is why radar antennas have such strange 
appearances (see Fig. 162). 


CHAPTER XXII 
DIFFRACTION OF X-RAYS BY CRYSTALS* 


143. Crystals as Diffraction Gratings 


As has been already indicated, a diffraction grating usually con- 
sists of a piece of glass on which equally spaced lines are scratched. 
What is essential here to obtain one of the typical diffraction patterns 
discussed above? Is it the presence of glass, the shape of the lines, 
the thickness of the glass, or the width of the “slits”? A careful study 
of Sec. 141 shows that the essential element is the periodic repetition 
of the nonuniformities of the scattering substance. Thus, irrespective 
of the cause of the scattering and the nature of the nonuniformities, 
as long as these nonuniformities are repeated with a periodicity a, 
Scattering maxima will occur at angles @ satisfying the equation 
asin pm = na. Such a pattern is given by lines of any shape scratched 
on any glass, or slits of any shape made in any screen. It is only 
necessary that the distribution of matter repeat with a periodicity a. 

To be sure, certain differences in patterns may exist. The inten- 
Sities of rays diffracted in different orders may differ depending 
on the shape of the slit. The distribution of substance within a repeat- 
ing nonuniformity affects the scattering intensity f°, which for 
different orders may have different values. 

Now that we have reviewed the results of Sec. 141, let us turn 
to crystals. The ‘basic feature of crystals distinguishing them from 
other bodies is the periodic distribution of matter. Along any 
direction of a crystal, the time-averaged electron density is periodi- 
cally repeated. In the simplest case, the electron density distribu- 
tion will appear as shown in Fig. 163. This figure shows the electron 
density (the number of electrons per cubic Angstrom) along a line 
parallel to the edge of a cube of rock salt. An electron density max- 
imum corresponds to the centre of an atom. The large maximum 
corresponds to the centre of a chlorine atom and the small maximum 
to that of a sodium atom. The pattern repeats itself after every oth- 
er atom, and the period of the electron density distribution along 
the line is equal to 5.6 A. This is the pattern of the distribution 
obtained along a specific line. Along a slightly displaced parallel 
line, the distribution will be different. 


* Before reading Chapters XXII and XXIII, it is recommended that Chap- 
ter XXXII be perused. 
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However, a crystal is a three-dimensional formation, and the 
repeating element is a three-dimensional cell. The electron density 
distribution of a cell cannot be illustrated graphically, but it is 
sufficient to know that the cells repeat in space. The similarities 
and differences between a crystal and a diffraction grating are evi- 
dent. A crystal is a three-dimensional diffraction grating in which 
the nonhomogeneous element repeats regularly in three-dimensions 


Flectron density 
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Fig. 163 


rather than along a line. The role of “slit”, i.e., repeating nonuni- 
formity, is played by the unit cell of the crystal. 
Let us determine the nature of the diffraction pattern created 
by a crystal. ” 
X-rays are scattered by electrons. The nonuniformities in the 
electron density are of such a nature that wavelengths of the order 


of i1-2 A yield perceptible diffraction. In order to determine the 
direction of the diffracted rays, the wavelets coming from all the 
cells must be added. Of course, the amplitudes of these wavelets 
for a given direction are the same. The difficulty arises in taking 
account of the phase differences between wavelets scattered by 
individual cells. These wavelets must be added for every direction 
and the directions in which the wavelets reinforce each other to 
a maximum extent determined. 

The problem can be solved in various ways since various summation 
sequences are possible. For example, first wavelets from the 
cells along edge a may be added, then wavelets from all columns 
in the plane ab, etc. But we shall use a much simpler method. This 
is the method proposed by the founders of X-ray structural analysis, 
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the two Braggs—father and son, British scientists. (The same idea 
was proposed independently by the Russian crystallographer Vulf.) 
In a crystal, parallel planes can always be passed through the lattice 
nodes in numerous ways; the crystal is composed of the layers between 
such successive planes. Let us construct a normal to these layers’ 
and imagine the electron density projected onto the direction of the 
normal. Clearly, a periodic 
electron density distribution 
exists along the normal. 
The period d is appropriate- 
ly called the interplanar 
distance. 

The condition for maxi- 
mum reinforcement of waves 
scattered by the cells of one 
layer is that the angle of 
incidence equals the angle 
of reflection. Thisconclusion — ~— — | — —|——7—-p— 
is based on Huygen’s 
principle, for only when the 
above condition is satisfied Fig. 164 
are the scattered waves 
propagated in phase, and thus reinforced. Waves of successive layers 
reinforce each other when certain additional conditions are met. 
Fig. 164 shows that the path difference between rays “reflected” 
from the corresponding elements of two adjacent layers is equal 
to 2d sin 0. Thus, diffracted rays are obtained when the condition 


2d sin8 =na 


is satisfied. 

A diffracted beam is obtained when a system of planes may be 
found, among the countless such systems into which the crystal may 
be divided, which satisfies the condition 2d sin 0 = nd. Of course, 
this requirement may simultaneously be met by several systems of 
planes. However, the more likely situation is that diffraction does 
not occur for an arbitrary direction of a monochromatic beam. 
Hence, in order to observe diffraction, the crystal must be turned 
until a suitable angle ® is found 


Example. The interplanar distance in a calcite crystal is equal to 3.029 A. 
In X-ray structural analysis, radiation from a copper anode is often used. 


Since this radiation has a wavelength of 1.54 A the diffraction maximum of 


first order occurs at 0 = arcsin oa © 14°40’, 


25—1409 
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amplitude of a cell is expressed by 


d 
F= \ 0 cos (wt + @) dz. 


~ 


the interplanar distance); 


( 


Fig. 167 
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The integral is taken over a single period 
the values of p and @ for each z differ. 
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We shall not concern ourselves with the problem of determining 
the electron density of a crystal at all points of a unit cell. In all 
cases the basis of the solution is the above equation. It enables us 
to calculate the amplitude of any diffracted beam if the electron 
density of the crystals is known. Of interest, however, is the 
converse problem—the determination of the electron distribution in 
a crystal from the experimentally determined intensities of diffract- 
ed beams. This problem has been solved for very complex crystals 
containing hundreds of atoms per unit cell. 

In the case of anthracene crystals, the intensity of about 600 
diffracted beams has been measured. Using these data, the values 
of the electron density at all points of such a cell were determined. 
Fig. 167 illustrates the electron density for a cross-section through 
the centres of atoms of an anthracene molecule. (The method adopt- 
ed to indicate the electron density is that used by topographers to 
indicate elevation. Electron density contour lines on the electron 
density diagram correspond to elevation contour lines on a topo- 
graphical map.) The fourteen distinct peaks represent fourteen hydro- 
gen atoms. Experiments show that the distance between adjacent 
peaks is 1.4 A. The height of an electron density maximum is pro- 
portional to the number of electrons in the atom. Carbon atoms 
have six electrons each and hydrogen atoms one. Not all of the ten 
peaks corresponding to the centres of the ten hydrogen atoms con- 
tained in an anthracene molecule are fully sketched in the diagram. 
The chemical formula of anthracene is Cy,Hio. 
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The measurement of the angles 20 formed between the diffracted 
beams and the beam incident on a crystal, as well as the intensities 
of the diffracted beams, may be accomplished using an ionisation 
chamber (see p. 532) or a photographic method. A photographic 
film on which the traces of many diffracted beams are simultaneously 
recorded is called a roentgenogram. 

But how can the traces of several beams be obtained on a single 
film when, as has been already indicated, the condition nA =2d sinO 
in all likelihood will not be fulfilled even once for an arbitrary orien- 
tation of the crystal relative to the beam? This can be accomplished 
in three ways: 

1) by rotating the crystal, thereby setting various systems of 
planes in a reflecting position; 

2) by illuminating the crystal with a continuous spectrum of 
wavelengths in a band sufficiently broad so that almost every system 
of planes finds a “suitable” wavelength in the spectrum; and 
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3) by obtaining a roentgenogram of a powder, since in a powder 
the diffraction condition for any d is always fulfilled forsomo crystals. 
The first method is known as the crystal rotation method, the 
second as the Laue method, after the German physicist whose name 
is associated with the discovery of beam diffraction, and the third 
as the powder: or Debye method, after the scientist who proposed 
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Fig. 168 


this method. The Laue method has extremely limited application. 
In practice, the rotation method is used for the investigation of crys- 
tal structure, i.e., for the determination of arrangement of atoms 
and the Debye method is used for special problems arising in connec- 
tion with the investigation of fine crystalline substances. 

The purpose of a rotating roentgenogram is to gather data on 
one film regarding existing interplanar distances and intensities 
of the respective beams. However, it is also necessary to know the 
orientation of the system of planes relative to the crystal axes. This 
requires knowledge not only of the location of a particular spot 
on the film, but also of the instantaneous orientation of the crystal 
when it arose. In order for the roentgenogram to provide this infor- 

-mation as well, the film is displaced during filming. Such filming 
methods are known as roentgengoniometric methods. 

Fine crystalline substances are used much more often than mon- 
ocrystals for the investigation of crystal structure by means of 
X-rays. Fig. 168 illustrates how a roentgenogram is obtained by 
the Debye or powder method. : 
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Let us direct our attention to a specific system of planes sepa- 
rated from one another by a distance d and having a normal 2. 
Assume that the normals of the crystals are directed in all direc- 
tions. A beam “reflected” from a plane is not produced by reflection 
from all the crystals, but only from those whose planes are at an 
angle @ to the beam, where % satisfies the condition 2d sind = nd. 
Accordingly, the normals of these crystals are at an angle of 90°-6 
to the primary beam. All crystals whose normals lie on the cone 
having an apex angle 2 (90°-6) are in a reflecting position for planes 
separated from one another by the distance d. Therefore, the “reflect- 
ed” beams also form a cone, and this cone has an apex angle of 40. 
Ring patterns are obtained when these cones are intersected by a 
photographic film or plate. 

If we are concerned with values of @ not exceeding 20-25°, the 
roentgenogram may be taken on a flat plate. In this case, we obtain 
a system of concentric rings. However, if information on all inter- 
planar distances is desired, and scattering over the entire interval 
of angles possible is to be analysed, cameras with cylindrical film 
are used. In this case, the height of the film is reduced, so that only 
parts of the rings are photographed. 

If for some problem or other it is important to ascertain the inter- 
planar distances more accurately, we usually resort to “rear” film- 
ing, i.e., an arrangement whereby diffraction cones having apex 
angles close to 360° are filmed. 

In determining the scattering angle 6 from the film, a measure- 
ment error AO inevitably arises. Let us see how the magnitude of 
this error is reflected in the determination of the magnitude of the 
interplanar distance. After differentiating the diffraction condition 


2d sin? = nd, we obtain 
a 


It is seen that the accuracy in measuring the interplanar distance 
rapidly increases as the angle @ approaches 90°. The diffraction 
angle may be easily measured with an accuracy of the order of 0.1°. 
Hence, the above equation shows that along lines for which the 
angle } is equal to 65°, 75° and 85° the interplanar distance may be 
measured with an accuracy of 0.13%, 0.08% and 0.05%, respective- 
ly. Using special cameras, the method of rear filming yields very 
good results, enabling interplanar distances to be determined with 
an accuracy of 0.00001 A. For best results, the radiation wavelength 
is selected so that the scattering angle approaches 90°. 

All three types of Debye filming are widely used in the investi- 
gation of the structure of matter. Every substance yields a specific 
system of lines that is characteristic only of it. A phase transforma- 
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cules is not necessarily manifested. Cubic crystals possess isotropic 
polarisability. Here, the equation P = aH remains valid, just as 
for isotropic bodies. To make this clear, let us consider molecules 
whose electrons may be displaced only in a single direction. The 
symmetry of a cubic crystal is such that molecules whose axes of 
polarisability form right angles may always be found in the crystal. 
Let us consider three such molecules whose axes of polarisability 
lie along the z, y, z axes of a Cartesian system of coordinates. When 
an arbitrarily directed field EF is 
applied, these molecules become 
polarised and produce dipoles of 
moments BE cos a, BE cos G2 
and BE cos a3, since the moment is 
proportional to the projection 
of KH on the polarisation direction. 
The resultant of these three dipole 
Moments is obtained by vector 
addition. But a1, a, a3 are the 
angles formed by the vector HZ with 
the coordinate axes. Hence, the mag- 
nitude of the resultant moment is 
EV B? (cos? a1+- cos? a, + cos? a3) =BE 
and the direction is parallel to Z. 
Taking the summation for all the 
molecules, we arrive at the same conclusion as regards the pola- 
risation vector P and the polarisability a per unit volume. 

Now, let us consider crystals having one main axis, i.e., crystals 
with tetragonal and hexagonal syngony. To be specific, let us 
consider the former, i.e., crystals in which each molecule has three 
identical molecules related to it via an axis of symmetry of the 
fourth order. Assume, moreover—again for purposes of simplicity— 
that the molecule can be polarised only along one axis. Let us direct 
our attention to the four molecules whose axes of polarisability 
form an angle e with the main axis (see Fig. 169). How do these 
molecules behave in electric fields of various directions? If the vec- 
tor £ is directed along the main axis, the polarisation is propor- 
tional to cos e. Moreover, in view of the symmetry of the arrange- 
ment, the resultant dipole moment of these molecules is parallel 
to £; hence, the polarisation vector is also parallel to E: 


P=a£, 


where aq is the polarisability created by all the molecules for this 
direction of the field. 

In the projection onto the plane perpendicular to the main axis, 
the polarisability axes form right angles with each other. Therefore, 


Fig. 169 
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the result obtained for a cubic crystal is valid here, namely, the 
polarisability is the same for all directions of FZ in the plane perpen- 
dicular to the main axis. If H is perpendicular to the main axis, 
the polarisation vector P is again parallel to Z: 

P=aleE. 


While the polarisability of the molecules along the axis is pro- 
portional to cos €, the polarisability of the molecules in the direc- 
tion perpendicular to the axis is proportional to sine. This means 
that a and a, are different. 

What is the situation when the field FE is inclined to the main 
axis of the crystal? In view of the difference in the polarisabilities 


Fig. 170 


% and @,, the vector P can no longer coincide with the direction 
of the field, and the value of a will also be different. Knowing a, 
and @,, « may be calculated for any direction. We shall not explain 
how this is done in the general case, but merely cite a numerical 
example. 

For a crystal of Iceland spar (calcite), a, = 0.139 and a, = 0.095. 
Assume that the vector # forms a 30° angle with the plane perpen- 
dicular to the main axis and that it is directed as shown in Fig. 170. 
The polarisation vector in the indicated plane is then 


P, =a ,E, =0.139 x Ecos 30° =0.120 E. 


The polarisation vector perpendicular to this and parallel to the 
main axis is 


Pi = on, E =0.095 x E sin 30° = 0.0475 EB. 


Therefore, the angle between the resultant polarisation vector P 


: 0.0475 , : 
and the plane is arctan [755 ~ 21°40’. This means thas the angle 
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between P and FE is ~ 8°20’. The magnitude of the polarisation 
vector is P= /P% + Pi, = 0.129 £, i.e., in this case the polari- 
Sability a@ is equal to 0.129. 

If # is directed at a 45° angle to the main axis, the polarisability 
a decreases even more relative to a, and becomes equal to 0.120. 
The angle between F and P is then 


0.095 
0.139 


The direction of the vector # in Fig. 170 is such that the angles 
formed by this vector with the polarisability axes of the molecules 
differ. Thus, the angles are greater for the pair of molecules on the 
left. As a result, the right pair of molecules is polarised to a 
greater extent. 

The fact that the polarisability of a crystal possesses an axis 
of symmetry of the fourth order does not signify symmetry in the 
contributions of individual molecules to the magnitude of the 
polarisability for an arbitrary field direction. 

Thus, the values of the polarisabilities differ for different direc- 
tions. This has important consequences: The polarisability is unique- 
ly related to the dielectric constant; but e determines the index 
of refraction (see Sec. 125; « = n®) and hence the wave propagation 
velocity in the crystal; as a result, the electromagnetic wave is 
propagated in the crystal with different velocities in different 
directions. 

Tetragonal and hexagonal crystals (in optics they go under. the 

heading “uniaxial”) possess the following characteristic: All orien- 
tations obtained by successive rotation about the main axis are 
optically equivalent. Crystals having a lower order of symmetry 
do not possess this feature. 
; Uniaxial crystals have a main direction and, perpendicular to 
it, a main plane. If the vector H points in this direction or lies in 
this plane, then P || £ (and, therefore, D || Z). Analysis shows 
that in other crystals only three main mutually perpendicular direc- 
tions in which P || E may be distinguished. 


45° — arctan = 10°30’. 
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Division of a Light Field into Two Waves. We shall restrict 
ourselves to the study of phenomena occurring when light is inci- 
dent on the face of a crystal cut in two different ways, namely, per- 
pendicular to the main axis and parallel to the main axis. 

Light propagated along the main axis differs in no way from 
light propagated in isotropic bodies. The electric vector produces 
polarised oscillations of the dipoles in the direction perpendicular 
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to the main axis. Hence, the wave is propagated with the velocity 
Up = ne where no = V €,; &, is the dielectric constant for the 


direction perpendicular to the axis. The designations mp and vo indi- 
cate that the index of refraction and the velocity of light are 
“ordinary”. 

Recalling that e=1+4na, we find that for Iceland spar, myp= 
= Vi-4na) = 1.658. Hence, v9=1.81 1010 cm/sec. 


The passage of light through such a crystal in the direction of 
the main axis does not change its polarised state. Natural light 
remains natural, and the oscillation direction of the electric vector 
for a polarised wave does not change. 

The simplicity of the case considered is characteristic of a uni- 
axial crystal. Here, any polarised state of an incident wave is capable 
of exciting oscillations in the direction perpendicular to the main 
axis. Hence, the polarisability of the molecules (also & and 7) is 
the same for all such oscillations. 

Now, let us consider the case of normal beam incidence on the 
face parallel to the main axis. 

Different polarised waves behave differently. Consider the behay- 
iour of a linearly polarised beam. If the electric vector is perpendic- 
ular to the axis, the light is propagated with the same velocity ug 
as in the preceding case. But if the electric vector is parallel to 
the axis, the polarisation of the dipoles occurs along an axis for 
which the dielectric constant has another value, namely, e. There- 
fore, for this propagation direction, the velocity and the index of 


: Cc — 
refraction have other values, namely, ve = — and Me = V eq, 


e 
respectively. The designations me and ve indicate that the index of 
refraction and the velocity of light are “extraordinary”. The reasons 
for the above designations will become evident below. 
Crystals for which ve < vo are called optically positive; on the 
other hand, those for which ve > vp are called optically negative. 


For Iceland spar, nes V1-+4na i =1.486 and veg=2.02x1010 cm/sec. 
Iccland spar is an optically negative crystal since ve > v9. 


Elliptical Polarisation. What is the situation when the electric 
vector EH of the wave incident on the face of the crystal forms an 
angle @ with the direction of the main axis (Fig. 171)? Experi- 
ments show, and this may be predicted from Maxwell’s equations, 
that the electromagnetic wave becomes divided into two parts. The 
vector £ must be resolved into the components £ sin g and E cos @. 
The first corresponds to a wave travelling with the velocity v9, and 
the second to a wave travelling with the velocity ve. This may be 
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shown by determining the path difference between the two waves. 
that are created upon division of the incident wave. Designating 
the thickness of the crystal by 7, the phase difference can be ex- 


pressed as 6 = a (%e—N9). 


Thus, the polarised state of the wave leaving the crystal has 
changed significantly. The wave incident on the crystal was linearly 
polarised, while the one leaving is a combination of two waves 
having mutually perpendicular oscillation directions and displaced 
relative to each other by 6. What is the nature of this peculiar po- 
larised state? Such light is said to be 
elliptically polarised since the terminus 
of the electric vector describes an ellip- 
tical spiral. If the electric vector of one 
of the waves is given by 


E, = E sin 9 cos wt, 
then for the other wave the electromag- 


netic oscillation in the plane perpen- 
dicular to the beam will have the form 


E, = E cos @ cos (wt + 8). 


The addition of such oscillations has 
Ries been considered earlier (see p. 108). It 
B- was seen that the point describing two 
such oscillations is an ellipse. The same applies to the terminus of 
the electric vector, but, since the wave is advancing, the terminus 
of the vector EF describes an ellipse in its projection on the plane 
perpendicular to the beam. In space, the terminus of vector 
describes an elliptical spiral winding about the beam axis. 

To obtain circularly polarised light by this method, a “quarter- 
wave plate” is used. This is a plate producing a path difference of 
4/4 between waves travelling with velocities vp and ve. The thick- 
ness of such a plate must satisfy the equation 


Qn It 
Sl (no — ne) = oe + mn. 
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If a linearly polarised beam impinges on such a plate so that the vec- 
tor forms a 45° angle with the direction of the main axis of the 
crystal, resolution of this vector yields 
E EE 
Se =—=cosm and E, =~——sinwt, 
me FR u 73 
Ei+ E}= > B. 


But this is the equation of a circle. Thus, the described experimen- 


148. Propagation of Light tn Uniaxial Crystals 399 


tal conditions lead to the transformation of linearly polarised light 
into circularly polarised light. - 

Double Refraction. The apparent bifurcation of objects viewed 
through a transparent crystal is a phenomenon that has been well 
known for a long time. It shows that division into two waves may 
occur not only as regards the propagation velocities, but also as 
regards the beam directions in space. Double refraction occurs for 
normal light incidence on a crystal face (ground or naturally 
formed), which is at an angle to the optic axis. The phenomenon may 
also be investigated by means of a plate cut parallel to the axis. 
In this case, the light must be incident at an angle to the normal. 
We shall direct our attention to the latter case. Let us introduce 
another restriction, namely, that the beam be directed in such 
a manner that the plane of incidence of the light is perpendicular 
to the optic axis. 

Assume that a polarised beam impinges on the plate at an angle 
i. By turning the beam about its axis, the position of the electric 
vector relative to the plane of incidence is changed. 

When the electric vector coincides with the incident plane (see 
Fig. 172a), no special effects are noted. Refraction occurs in ac- 


cordance with the law for isotropic bodies, namely, Se 

The refractive index turns out to be 7m. This is as it should be, 
for the electric vector is perpendicular to the main axis of the crys- 
tal. When the beam is turned 90° about its axis (see Fig. 1720), 
it is also refracted. But now Cie = Me, i.e., the refraction angle 
is different and the index of refraction is that for an extraordinary 
beam. This too is natural, for the vector E coincides with the direc- 
tion of the main axis. 

Most remarkable is that an intermediate position does not yield 
a beam with an intermediate angle of refraction, but yields rather 
two beams—an ordinary and an extraordinary beam having refrac- 
tive indexes 79 and me, respectively. As before, the field intensity 
vector is resolved into two vectors, one lying along the main axis. 
and the other perpendicular to it. Each component creates its own 
field, or wave. In turning the beam of light about its axis, the inten- 
sities of these two beams continuously change; when one beam 
decreases in intensity, the other increases. 

Since the beams are refracted twice, i.e., in entering and leaving 
the plate, the ordinary and extraordinary beams emerge parallel 
to each other. The thicker the plate, the greater the separation 
between beams. If a narrow heam of incident light is used, the 
difference between the refractive indexes may be determined by: 
measuring the beam displacements. 


= No. 
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Now we can explain why the beams are called “ordinary” and 
“extraordinary”. Let us begin turning a crystal plate, whose optic 
axis is parallel to the face, about the normal to the reflecting face. 
If we were dealing with an isotropic body, such rotation could not 
affect reflection and refraction. When we rotate the crystal plate as 
indicated, nothing happens to one beam, i-e., its position in space 
and its intensity remain unchanged. That is how an ordinary beam 
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behaves. It is understandable, therefore, that the beam whose elec- 
tric vector is perpendicular to the main axis of the crystal is called 
ordinary. In this experiment, the electric vector component lying 
in the plane of incidence is always perpendicular to the main axis 
of the crystal. This component acts in an “ordinary” manner. On 
the other hand, the component of EH perpendicular to the plane 
of incidence forms an angle with the main axis of the crystal that 
varies as the crystal is rotated. During such rotation, not only 
does the extraordinary beam’s intensity vary, but its position 
in space varies as well. We see that the extraordinary beam does 
not obey the laws pertaining to isotropic bodies. In the general 
case, the refracted beam is not in the plane of incidence. 
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We shall not go into the rather complex explanation for these 
phenomena. It should be noted however, that these phenomena are 
in complete accord with Maxwell’s electromagnetic field theory. 


149. Polarisers. Investigation of the Polarised 
State of Light 


It was stated on p. 336 that a dielectric placed with its plane 
surface at an angle p, to the incident beam may serve asa_polaris- 
er. In this case, the reflected beam is completely polarised and 
the refracted beam is polarised to the maximum extent possible. 
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However, it is not convenient to usea reflecting plate as a polaris- 
er, for the polarised beam travels at an angle to the incident beam. 
A stack of glass plates may be used instead. By repeated refraction, 


Fig. 174 


-an almost completely polarised beam may be obtained. However, 
a considerable portion of the light is absorbed in such a device 
(see Fig. 173). 

The best kind of polariser is a crystal in which the linearly polar- 
ised ordinary (or extraordinary) beam may be separated out by 
means of double refraction. Such polarisers are known as Nicol 
prisms, or simply Nicols. 

The polariser proposed by the French scientist Nicol consists 
of two right-angled prisms made of Iceland spar (Fig. 174). These 
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prisms are glued together with Canada balsam, a substance whose 
refractive index n is 1,550. This value lies between mp and n- for 
Iceland spar. A nonpolarised beam of light impinging on the prism 
is divided into two components. The ordinary beam is reflected 
at the boundary between the prisms, where the condition for ils 
total reflection is satisfied, and is deflected to one side. The extra- 
ordinary beam passes through both prisms. Thus, a Nicol acts as 
a slit passing only electric vector oscillations directed in a specific 
direction. 

Pleochroism is of great practical importance in polariser appli- 
cations. This term is used to indicate that the ordinary and extra- 
ordinary beams are absorbed differently and that the absorption 
coefficient of the extraordinary beam is a function of the direction 
relative to the optic axis. A pleochromatic crystal gives a different 
hue and absorbs light differently when it is turned relative to 
the beam. 

Tourmaline is a classical example of a pleochromatic crystal. 
The absorption coefficient for the ordinary beam over almost the 
entire visible spectrum is so great that a tourmaline plate of 4 mm 
thickness, cut parallel to the optic axis, transmits in effect only the 
extraordinary beam and, hence, may serve as a polariser. However, 
the yellowish-green hue of the transmitted light prevents tourma- 
line from being used in practice as a polariser. 

Polaroids, synthetic pleochromatic films, have wide applica- 
tion and may be prepared from herapathite (quinine sulphate perio- 
dide), a strongly pleochromatic substance. A polaroid is a trans- 
parent plastic film consisting of submicroscopic crystalline hera- 
pathite needles oriented in a single direction. To orient the crystals, 
a viscous mass of the crystals is placed between two glass plates 
and subjected to reciprocating motion. To be sure, herapathite is 
not the only substance suitable for the production of polaroids. 

Iodine atoms, constituents of herapathite, are essential to secure 
pleochromatic properties. Pure iodine polaroids may be produced 
by iodising thin films of polyvinyl alcohol. 

The polarised state of light may be investigated with the aid 
of two Nicol prisms, or other polarisers, and a “quarter-wave plate”. 
A 0.038 mm-thick sheet of mica may serve as the quarter-wave 
plate. Let us consider the passage of light through two Nicol 
prisms. In order to distinguish between the two, the first prism in the 
path of the beam is called the polariser and the second the analyser 
(see Fig. 175). If a beam of natural light of intensity J) impinges on 


the polariser, a linearly polarised light of intensity +. I, emerges 


from the prism. Naturally, turning the polariser about its axis in 
no way changes the intensity of the transmitted beam. By means 
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of the analyser, it may be shown that the Nicol prism has indeed 
transmitted a linearly polarised beam. If the Nicol prisms are 
oriented with their “slits” parallel to each other, the light will 
be transmitted through the analyser as well without change in 
intensity (disregarding absorption in the material of the polarising 


device). For crossed prisms, i.e., when the “slits” are at right angles 
lo each other, light is not transmitted (see Fig. 175). The intensity 
of the light when an angle a is formed between “slits” is J = 
= 1/,J cos*a. Thus, the electric vector of the wave arriving at the 
analyser may be resolved into two com- 
ponents—one parallel to the “slit” and 
the other perpendicular to it. Since the 
component that is passed is E cos a (see 
Fig. 176), the intensity is proportional to o 
cos*a. 

The first prism will already reveal 
whether or not the light was partially or 
completely polarised. 

Using two Nicol prisms, it is not pos- 
sible to distinguish circularly polarised 
light from natural light, and elliptically polarised light from par- 
tially polarised natural light. To do this, the quarter-wave plate 
may be used. If it is placed before the polariser, it in no way affects 
naturally polarised light, but circularly polarised light is trans- 
formed into linearly polarised light. Similarly, a quarter-wave plate 
changes the properties of elliptically polarised light. 


| 
|fpass = Ecos 
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Fig. 176 


150. A Crystal Plate Between “Crossed” Nicol Prisms 


The method of investigating transparent anisotropic substances 
by observing the behaviour of linearly polarised light impinging 
on them is very widespread. In order not to complicate the problem, 
assume that we are dealing with a crystal plate cut parallel to the 
optic axis. This plate is placed between Nicol prisms. 

From the polariser, a linearly polarised beam impinges on the 
plate. Remove the plate and place the analyser in a crossed posi- 
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tion. Light is nol transmitted. Now put the plate back. The field 
becomes illuminated, i.e., the beam of light is transmitted through 
the system. There can be only one explanation, namely, the crystal 
plate has changed the polarised state of the beam coming from the 
polariser. By means of the analyser, we may determine the exact 
nature of the change. If upon turning the analyser a new position 
for darkness is found, the conclusion is that the crystal plate has 
changed the direction of the beam oscillations but has left them lin- 
early polarised. If the intensity of the light isnot changed by turn- 
ing the analyser, the conclusion is that the plate has transformed 
the linearly polarised light into circularly polarised light. Finally 
if the light is not extinguished by turning the analyser or plate, 
but the intensity of the light is changed thereby, the conclusion 
is that the plate has created elliptically polarised light. 

The changes produced in linearly polarised light depend on two 
things—the mutual orientation of the plate’s optic axis and the 
oscillation direction of the beam coming from the polariser, and the 
phase difference 6 = + (nr — Ne) created by the plate between 
the ordinary and extraordinary waves into which the incident wave 
is divided. 

If the substance placed between the crossed Nicol prisms is iso- 
tropic, no illumination of the field occurs. The phenomenon de- 
scribed above may be utilised in the investigation of anisotropic 
substances. 

Usually, observations are made using crossed Nicol prisms be- 
tween which the plate is rotated. During such rotation, the illumi- 
nation does not remain constant. At every instant, the amplitude A 
of the light emerging from the polariser is resolved into the compo- 
nents A cos @ and A sin g, where @ is the angle between the polariser 
“slit” and the optic axis of the plate. The terminus of the electric 
vector of the wave emerging from the plate describes an ellipse: 

Bey 
Arcos? ' A®sin?@ A%cospsine 
where 6 is fixed and @ changes continuously. Fig. 177 shows ellipse 
transformations for the case of a quarter-wave plate, i.e., in tlfe 
above formula 6 = 90°. For different values of ~, different polar- 
ised states are obtained. 

Since the path difference 6 depends on the wavelength, the pat- 
tern is coloured when white light is used. If the plate is of uniform 
thickness, it will have a single colour that differs for every differ- 
ent relative orientation of the plate and Nicol prisms. Thus, for 
certain wavelengths of white light, the plate thickness may be 


cos 6 = sin? 6, 


equal to x , for others it may be equal to x , and for still others a mul- 
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tiple of A. Accordingly, for different wavelengths there arise differ- 
ent polarised states, which are transmitted by the analyser to 
a greater or lesser degree for different relative orientations of the 
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plate and Nicol prisms. The phenomenon of chromatic polarisation 
is very beautiful. It is hardly likely that the richness of shades and 
hues observed when the thickness of a plate changes (e.g., in crys- 
tal growth), or when the orientation 
of a plate is varied relative to Nicol 
prisms, may be achieved by any other 
means. 

If a plate is of variable thickness, 
the interference fringes are rainbowed 
when observed in white light. 

In addition to these palternsof fringes 
representing equal thickness,  dis- 
tinctive fringes representing equal 
inclination may be observed if the 
crystal plate is observed using con- 
verging rays (iconometric investiga- 
tion). These observations may be made 
on small crystalline granules in the 
field of vision of a microscope. Their Fig. 178 
practical significance lies in the de- 
termination of crystal symmetry. In particular, it is not difficult 
to determine to which of the following three groups an object belongs: 
1) amorphous or crystalline substances having cubic symmetry; 
2) uniaxial crystals; and 3) crystals having symmetry of lower or- 


408 Double Refraction 


the relaxation time, i.e., the time required for the molecules to 
assume the appropriate orientation in the electric field, is of the 
order of one-thousand millionth of a second. Therefore, electric 
oscillations modulated by sound may be transformed into light 
intensity variations. This makes it possible for sound to be recorded 
on photographic film. 


152. Optical Activity 


The ability of certain substances to change the oscillation direc- 
tion of a linearly polarised beam is known as optical activity. The 
phenomenon may be described as follows. Consider an arrangement 
of crossed Nicol prisms with an optically active substance placed 
in the path of a beam. The field becomes illuminated, but the 
illumination disappears when the analyser is turned by some angle a. 
Thus, linearly polarised light transmitted through an optically. 
active substance remains linearly polarised, but the oscillation 
direction of the beam changes by the angle a. Experiment shows 
that the change in oscillation direction is strictly proportional 


to the thickness of the layer of substance: 
J 


a=pod. 

The constant p characterising the substance is known as the specific 
rotation constant and is usually expressed in degrees per millime- 
tre. The phenomenon exhibits dispersion since p is a function of 
the wavelength. Normally, p decreases with increasing wavelength. 
_ The change in oscillation direction is quite considerable and 
in the case of many substances attains a value considerably greater 
than ten degrees per mm for a number of wavelengths. For water 
solutions of organic substances, the rotation of the polarisation 
plane is a function of the concentration, i.e., @ = ped, where c is 
the concentration. 

What kind of substances are optically active? An optically active 
substance must be composed of structural units which have neither 
a plane of symmetry nora centre of symmetry among their elements 
of symmetry. In the case of molecular substances, such components 
are, as a rule, molecules. In the case of crystals, in which mole- 
cules are not distinguishable, such components are unit cells. 

Molecules, or cells, satisfying the above conditions, may be en- 
countered in the form of two optical isomers, designated by the letters d 
and J (dextro and levo, or right and left). An object and its image 
in a mirror are optically isomeric. A substance consisting of 
d-molecules (or cells) rotates light to the right, while one consist- 
ing of l-molecules rotates light to the left. By rotation to the right . 
we mean the case in which the analyser must be turned to the right 
(from the viewpoint of an observer facing the oncoming beam of 
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light) to restore darkness when the thickness of the layer of sub- 
stance is increased. Reversing the direction of the light does not 
change the sign of the effect. 

Optical activity may occur for substances in the liquid as well 
as in the solid state. It is merely necessary that there -be a surplus 
of d- or I-molecules. The orientation of these molecules may be either 
random or uniform. In the first case, the body is isotropic and 
the rotation is the same irrespective of the direction of the beam of 
light. In optically active crystals, the : 
magnitude of the rotation a is a func- 
tion of the beam direction relative to 
the crystal axes. 

When molecular crystals that rotate 
light are fused, the structural compo- 
nents remain intact. In such cases, the 
solid as well as liquid substance pos- 
sesses optical activity. An example 
of this is sugar, which, in addition, 
possesses activity in solution. This Fig. 180 
property of sugar is utilised in the 
saccharimeter to determine the amount of sugar in a solution from the 
magnitude of the change in the oscillation direction of a beam of light. 

The situation is different in such crystals as quartz (see Fig. 180). 

The arrangement of atoms in a quartz cell satisfies the necessary 
conditions, namely, it does not have a centre of symmetry nor 
a plane of symmetry. The molecules are not distinguishable in 
a quartz crystal; as a result, the arrangement of atoms changes upon 
fusion. Hence, in fused quartz the necessary structural units are 
not present. Fused quartz is, therefore, optically inactive. 
“ One and the same substance, from the point of view of chemical 
composition, may be encountered in the optically active form and 
in the optically inactive form. This applies not only to quartz. 
The structure of an inactive variety bears little resemblance to the 
structure of crystals possessing optical activity. 

The above is quite understandable in the case of ionic and homo- 
polar crystals. But how can an inactive crystal be formed from 
active molecules in the case of a molecular crystal? This occurs by 
the formation of racemic crystals. A racemic mixture is a mixture 
of equal quantities of d- and J-molecules. Such a mixture does not 
rotate light since the two opposite effects are equalised. A racemic 
crystal consists of pairs of d- and l-molecules. Every pair consti- 
tutes a centrosymmetric group of atoms. 

Optically active crystals exist in both the d- and l-form. The 
structures of such crystals are identical in the same sense that right 
and left gloves are identical. In the case of molecular crystals, this 
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means that in one case the structure is composed of d-molecules and 
in the other of /-molecules. Dextro- and levo-quartz, dextro- and 
levo-glucose, dextro- and levo-tartaric acid—all the properties of 
these substances, all the details of their structure, are the same 
except for the fact that light is rotated in different directions. 

Inorganic optical isomers (e.g., dextro- and levo-quartz) are 
-encountered in nature in equal quantities. This is not the case with 
organic molecules, which are important in biology. The French 
chemist Pasteur showed that a number of microorganisms are capa- 
ble of feeding on only a specific optical isomer. 
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How is the phenomenon of optical activity explained? Before 
answering this question, we shall show that a linearly polarised 


beam is equivalent to two circularly polarised beams that are dextro- 
and levo-rotated. 


Let us write the equations for the electric vector oscillations, 
taking into account that there is a phase difference & between these 
circularly polarised waves. For the dextro-rotated wave 


E% = Eycosat and E4 = Ep sin wi; 
for the levo-rotated wave 
E', = E, cos (wt +6) and Ey = — Ey sin (wt +6). 
The total field has the components 


E,=E:+E and E,y=Et+ £E', 

To determine the polarised state of the resulting oscillation, let 
us calculate the ratio ab for the total field. Using simple trigonomet- 
ric transformations, we obtain 

Ey 


6) 
‘Eee —tan>. 


The ratio is independent of time. It is seen that we are dealing with 
linearly polarised oscillations that form an angle oe with the 
z-axis. Q.E.D. 


On the basis of this conception, the phenomenon of rotation of 
oscillation direction is quite easily understood. Rotation of the 


oscillation plane by an angle = signifies that the levo-rotated wave 


is lagging behind the dextro-rotated wave (or vice versa, depending 
on the rotation direction) by the angle 6. In view of this expla- 
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nation, it is clear why a discussion of optical activity has been includ- 
ed in the chapter on double refraction. Here, too, the wave is 
divided by the substance into two components, one moving faster 
than the other and continuously advancing relative to it in phase. 
From this viewpoint, the specific rotation is proportional to the 
difference in refractive index between the dextro- and levo-rotated 
beams. 

This discussion has in no way advanced our understanding of 
the phenomenon. We have merely given it another (completely equiv- 
alent) interpretation. However, this new approach enables us to 
more easily explain optical activity. Circularly polarised waves 
that are dextro- and levo-rotated travel through a substance with 
different velocities. They encounter different indexes of refraction 
and, hence, different permittivities and polarisabilities. The dis- 
placements of the electron cloud under the action of these two waves 
must differ. One wave experiences more difficulty than the other 
in displacing electrons from their equilibrium positions. If we can 
determine the reason for this difference, the explanation for optical 
activity will have been found. 

We know from chemistry that if a molecule has an asymmetrical 
carbon atom, the substance may exhibit optical activity. By an 
asymmetrical carbon atom chemists mean a carbon atom bound to 
each of four different atoms or radicals. 

The angles formed between the bonds of a tetravalent_ carbon 
atom are approximately equal to those in a tetrahedron. Fig. 181 
shows a molecule containing an asymmetrical carbon atom. The 
radicals or atoms bound to C differ, but their nature is unimportant. 
We see, in the first place, that two different molecules of such a sub- 
stance, which are mirror images of each other, are possible. These 
are optical isomers and cannot be made to coincide. This may be 
easily demonstrated using wire models. 

Consider a circularly polarised wave travelling along the axis 
of symmetry of the bonds. In Fig. 182 the wave is directed outward 
from the page. Atoms A and B are higher than atoms D and E. Let 
us determine the directions of the electron displacements for dextro- 
and levo-rotated waves. Assume the situation is as follows for the 
dextro-rotated wave: When the vector # is directed along ED, in 
the upper “level” it is directed along BA. If that is the case, the 
situation is as follows for the levo-rotated wave: When the vector 
E is directed along ED, in the upper “level” it is directed along AB. 

[Examining the figures, we see that the displaced electrons behave 
differently. In the first case, the electrons of atoms A and D move 
simultaneously away from the centre. In the second case, when 
the electrons of A move toward the centre, the electrons of D move 
away from the centre. Such differences will always be found for 
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154. Basic Theory 


The theory of relativity, developed at the beginning of this 
century by the great modern physicist Albert Einstein, is based on 
two postulates: 1) the principle of relativity and 2) the principle 
of constancy of the velocity of light. We shall briefly consider the 
essence of these principles, describe the experiments confirming 
them, and discuss certain consequences of the theory. 

The theory of relativity originated with the questioning of the 
existence of a mechanical carrier (ether) for an electromagnetic field. 
The theory of relativity solved this problem and in this sense may 
be viewed as the perfection of electromagnetic field theory. While 
solving the problems posed by electrodynamics, the theory of rel- 
ativity went much further. Its development led to the establish- 
ment of the laws of mechanical motion at velocities close to that 
of light, to the law of the equivalence of mass and energy, and to 
new views on the nature of gravity. Since our discussion will be, 
of necessity, very brief, we are forced to dispense with an histori‘ 
cal narration. 

First, as to the essence of the main postulates. The principle 
of relativity states that all laws of nature (and not only the laws 
of mechanics) are the same in all inertial systems of coordinates. 
The principle states that not a single physical experiment could 
discover special properties for one of the inertial systems. All iner- 
tial systems are equivalent. 

The, second postulate pertains to the constancy of the velocity 
of light in a vacuum for all inertial systems. From this it follows 
that the velocity of light in the “receding” and “approaching” direc- 
tions must be the same, i.e., that the velocity of light is independent 
of the light source and measuring instruments. 

How do these principles affect our views concerning an electro- 
magnetic field and its carrier? It is not difficult to see from the for- 
mulations of the principles that electromagnetic waves and, say, 
sound waves, are not analogous. 

Imagine a laboratory isolated from the external world, moving 
rectilinearly and uniformly relative to the stars. In this laboratory, 
measurements are made of the velocity of sound in the direction of 
motion. Theoretically, two extreme cases are possible: in one, the 
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walls of the laboratory are impervious to air, so that the air is car- 
ried along by the laboratory; in the other, the walls are pervious 
to air, the air is stationary relative to the stars, and the laboratory 
moves through the air, i.e., without carrying it along. Assume in 
these two cases that measurements are made of the velocity of sound. 
The velocities are measured by two observers—one moving and the 
other stationary relative to the stars. In each case, the velocities of 
sound relative to these two observers will differ. If the velocity of 
sound in air is designated by c and the velocity of the laboratory 
relative to the stationary observer by v, then, in the case in which 
the air is carried along, the moving observer finds the velocity equal 
to c and the stationary observer finds it equal to c + v; in the case 
in which the air is not carried along, the moving observer finds the 
velocity equal toc — v and the stationary observer finds it equal to c. 

The postulates of the theory of relativity reject both variants 
in the case of an electromagnetic wave in an ether. In experiments 
with light waves, the velocity of light will be equal to c for the sta- 
tionary as well as the moving observer. This means that a station- 
ary as well as a moving ether is incompatible with the theory of 
relativity. Thus, the theory of relativity rejects the possibility of 
viewing the field as a medium in which mechanical displacements 
occur. We must conclude that electric and magnetic fields have 


a real existence. 


155. Experimental Verification of the Principle 
: of Constancy of the Velocity of Light 


At first glance, the principle of constancy of the velocity of light 
seems to fly in the face of “common sense”. Therefore, before discuss- 
ing certain consequences of the theory of relativity, it is desirable 
to describe the direct experimental evidence for its validity. This 
evidence is derived from astronomical observations. 

Astronomers have discovered the existence of so-called double 
stars. A double star consists of two heavenly bodies of approximately 
“the same mass rotating about their overall centre of gravity. We 
have the means to measure the distance between the stars, their 
mass and their velocity; also, to determine their relative motion. 
If the velocity of light depended on the velocity of the star itself, 
the velocity of the heavenly body would be added to the velocity 
of light when this body moved toward a terrestrial observer and 
subtracted when this body moved away from the terrestrial observ- 
er. In such a case, to the terrestrial observer, the motion during 
one half of the orbit would appear faster than during the other half. 
This effect would be detectible even if the velocity v of the heavenly 
body were one-hundred thousandth of the velocity of light c. 
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Thus, for a long distance J, the difference in times 7aER 


may be so considerable, even for very small values of v, thal not 
only is the periodicity disturbed, but a light beam transmitted 
during motion in the “receding” direction may overtake a beam 
transmitted during motion in the “approaching” direction. Then, rota- 
tion of the star would not be visible or would be of a peculiar nature. 
The periodic rotation of double stars may be understood only on 
the basis of the principle of constancy of the velocity of light. 

To be sure, our discussion has dealt with the motion of a light 
source, so that there may remain some doubt regarding the validity 
of the principle of constancy of velocity for the motion of an observ- 
er. Such doubt may be removed by another astronomical observa- 
tion, i.e., observation of the periodicity of the motion of Jupiter’s 
satellites. Measurements of the motion of Jupiter’s satellites may be 
made in two cases—when the light arriving on the Earth from J upi- 
ter coincides with the direction of motion ‘of the solar system and 
when it is in the opposite direction. The identicalness of the obser- 
vations and the distinct periodicity of Jupiter’s annual motion 
demonstrate the validity of the principle of the constancy of the 
velocity of light in this case as well. 

The most important role in the development of the theory of 
relativity was played by an experiment first performed by Michel- 
son in 1881 with the aid of the interferometer described on page 360. 
This experiment consisted in the following. The locations of two 
mirrors, i.e., the arm lengths J, and Jy, were selected in such a man- 
ner that the coherent beams into which a light signal is divided 
would require the same amount of time to cover the distances along 
the two arms of the interferometer. This selection is made when 
the interferometer is arranged in such a manner that one of the 
arms is parallel to the motion of the globe in its orbit. The instru- 
ment is then’ turned 90° and the interference fringes observed for 
possible displacement. 

The results of the Michelson experiment, which was repeated 
many times by Michelson and other investigators, are the following: 
no displacements of the fringes occur and the times required for 
light to cover the distances along the arms remain equal when the 
instrument is turned 90°. This conclusion is based on very accurate 
measurements. 

What is the significance of this experiment? Since the Earth 
moves with a velocity v ~ 30 km/sec relative to fixed stars, the dis- 
tances covered by the two rays cannot be the same from the view- 
point of a celestial inertial observer. 

Let us examine the paths of the two rays (see Fig. 183). Of course, 
we need only concern ourselves with the portions of the path along 


l 
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which the beams travel separately. The longitudinal beam in the 
“receding” direction must cover the distance of the arm length J, 
and overtake the mirror moving with a velocity vin the same direc- 


7 


Fig. 183 Fig. 184 


tion. Therefore, the distance ct, covered by the beam must equal J, + 
+ vt,. The time required for the wave front to reach the mirror is 


eee eae 
oe 
c (1-=) 
In the “approaching” direction, the beam covers the distance of 
the-arm length J; minus the distance covered by the approaching 


instrument. Therefore, the distance ct, covered by the beam must 
equal 1, — vt. Then, 


y= 


ly 
™%=——*_ . 
The time t; + t, measured in the experiment is equal to 
21, 


p2\° 
Now, let us direct our attention to the transverse beam (see 
Fig. 184). During the total time t, i.e., the time elapsed from the 
instant the beam leaves the centre of the instruments to the instant 
it returns, the mirror is displaced as shown in the figure. Therefore, 
27-1409 
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the distance covered by the wave is 


ot 2 B+ (3)*; 


whence, the time t is equal to 


In the first measurement, the arms J, and l, were selected in such 
a manner that the times required for the beams to cover the separate 
paths were equal. Hence, 


21, 2ls ly 


S00 = or — =] 

v2 v2 5 v2 2e 
e(- a) ef1-8 Vi-4 
However, in the second experiment, i.e., when the interferometer 
is turned 90°, there is no interference fringe displacement and the 
times remain equal even though the arms J, and l, have interchanged 


places! This is the surprising result of this experiment. 
Thus, if the first arm is longitudinal, 


ya 
Lehi 1-2, 


if the second arm is longitudinal, 


2 
I, => I, 1 — <> . 
Forv = 0, = 1,; but for v 0, we obtain a remarkable result: the 
length of one and the same segment differs, depending on whether 
this segment is parallel to the direction of motion or perpendicular 
to it. The obtained result is valid for any body and for any distance 
between two points. Thus, the first consequence of the theory of 
relativity is that a body moving relative to an inertial observer 
shortens its dimension in the direction of motion. The transverse 
dimensions remain unchanged. If an observer relative to whom an 
object is stationary finds that the length of this object is J), an observ- 
er relative to whom this object is moving with velocity v will find 


that its length is 
v2 
L=Iy V 1— =... 


Example. If an object moves with a velocity of 1,000 km/sec relative to 
some “stationary” observer, the length of the object in the direction of motion 
appears to be 1 divided by 1.000005. If the velocity of the object is 


200,000 km/sec, then 70 = 1.34, 
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The length of one and the same segment moving in a specific man- 
ner in different frames of reference, will differ. It is necessary to under- 
stand properly the relative nature of the above contraction. Take 
two rods of the same length J) and assume that their relative velocity 
is v. Now, assume that there are two observers—one moving with the 
first rod and the other with the second. In'such a case, the first observ- 
er will find that his rod has a length J) and the other a length 


lo V 1 2 . For this observer, the second rod will be shorter than the 
first. On the other hand, the second observer finds that the second 


7 2 
rod has a length Jp and the first a length lV 1— “5 . For him, the 


first rod is shorter than the second. 

The length of a rod (or, in general, the distance between two 
points) is a relative concept. Of all rod lengths measured in various 
inertial systems, the rest length J) is outstanding. This maximum 
length of a rod has absolute meaning. 
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In the expression relating the length of a rod at rest to the length 
of a rod in motion, the factor Y 1 — B? appears, where B = = . This 


factor also appears in analogous formulas relating the values of 
various physical quantities for stationary and moving observers. 
Using an approach similar to that taken in the preceding article 
leads to interesting results as regards time and acceleration, mass 
and force, momentum and energy, density of charge and current, 
field intensities, etc. The formulas of the theory of relativity enable us 
to convert values determined by a stationary observer to values 


. : . v . . 
determined by a moving observer. The ratio B = > is in all cases 


an important criterion of the need for a relativistic correction. 

It is easily seen that B* is comparable to unity only when the 
velocity is very large. Even when v = 100,000 km/sec, V1 — ? is 
only several per cent less than unity. It is, therefore, clear that the 
theory of relativity yields negligible corrections when the motion 
is slow, i.e., in such cases it is not necessary to take into account 
the changes in physical properties with motion. The theory of rel- 
ativity is of particular importance for the microworld, where par- 
ticles having velocities approaching the velocity of light are en- 
countered quite often. 

Let us direct our attention to the consequences of the theory as 
regards time. It turns out that the interval t during which an event 
occurs is also not the same from the viewpoint of two different iner- 


27* 
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tial systems. Thus, two events occurring simultaneously trom one 
viewpoint occur at different times—one earlier and the other later— 
from the viewpoint of another frame of reference. 

Qualitatively, this assertion follows immediately from the prin- 
ciple of the constancy of the velocity of light. Thus, consider a sys- 
tem moving uniformly and rectilinearly relative to another inertial 
system. In one of them, there is located a radiator from which light 
is radiated in all directions. In this system, let us select two points 
equidistant from the source of light along a straight line in the 
direction of relative motion. It is clear that in this system the light 
arrives at both points simultaneously. This is the situation from the 
viewpoint of an observer moving together with the source. However, 
to an observer in the other system the situation appears to be differ- 
ent. To this observer, one point is moving toward the signal and 
the other away. Since the velocity c has the same value for this observ- 
er too (moreover, the same in both directions), from his viewpoint 
the light arrives earlier at the point that is behind. 

A doubt may arise: cannot such a conclusion lead to absurdities? 
One may reason that since the concept of simultaneity is relative, 
it may happen that from the viewpoint of one frame of reference 
a gun is fired and then a wounded bird falls from a tree, but from the 
viewpoint of another frame of reference the bird falls before the 
gun is fired. Careful analysis shows that the relativity of a sequence 
of events is limited by the velocity of propagation of interaction 
(less than c). Therefore, “earlier” and “later” may interchange places 
only when they are not causally related, i.e., when they are not the 
result of interaction. 

A very interesting result of the theory relates to the proper time 
of an object, i.e., the time determined by a clock moving together 
with a given body. If a time t has elapsed according to the clock 
of an observer in a certain inertial system, the handle of the clock 
moving with the object will have advanced by the time 


t=tV1—B 
This means that a clock moving in any arbitrary manner moves 
slower than a stationary clock.* 

It is necessary to comprehend properly the relative meaning 
of this assertion. If two observers are in different inertial systems, 
each will assert that the clock of the other observer is slow. This 
would seem to be a paradox. Let us stop the observers and compare 


their clocks. However, to perform this check, at least one of the 
observers must perform a complete circuit. Upon returning to the 


* This formula has found experimental confirmation in experiments with 
p-mesons (see p. 557). ; 
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point of departure, the clocks may be compared. But now the deter- 
minations of the observers have lost their relativity. The observer 
remaining in an inertial system is justified in applying the above 
formula. The observer executing the circuit has undergone accelerated 


motion; hence, the formula to) = tV1 — B* cannot be used by him. 
Thus, after the observer executing the circuit returns and the clocks 
are compared, it turns out that his clock is slow. Moreover, he cannot 
“dispute” this result by reference to the principle of relativity, for 
this principle is valid only for inertial observers. 
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If the mass of a body measured in a system of coordinates to 
which it is bound is designated by mp, to an observer relative to 
whom this body moves the mass appears to be 


m= mo 
Vi-Bp2 * 


The quantity 7 is known as the rest mass and the increase in mass 
with increasing velocity is a natural consequence of the fundamen- 
tal principles of the theory. The velocity of light c constitutes 
a limiting velocity for any motion or transfer of interaction. For 

= c, the mass of a body becomes infinite. Of course, the closer 
a body approaches the limiting velocity, the more difficult it is to 
accelerate it. 

The increase in mass with increasing velocity was first detected 
for the electrons of B-rays as early as the beginning of this century. 


Since the electron velocity v and the ratio = may be determined 


independently (see p. 465), and since the electron charge remains 
unchanged, we are able to check the formula for mass. 
Corrections given by the factor / 1 — f? play an important role in 
the construction of accelerators of charged particles. The particle 
velocities attained in modern accelerators are so great that, for 
example, in one of them f reaches a value of 0.9986; thus, the mass 
becomes 60 times heavier than the rest mass. In all experiments 
conducted under terrestrial conditions with macroscopic bodies, 


we can disregard the V1 — PB? correction to the value of the mass. 
Nevertheless, it is desirable to check its validity not only for ele- 
mentary particles. This is possible by means of precise astronomical 
observations. It turns out that the change in mass of the planet 
Mercury during its orbital motion explains the small deviations of 
the orbit from an ellipse. 
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The momentum formula acquires the following form when we 
substitute the expression for the mass of a moving body: 


mov 
Pie 
It should be noted that Newton’s law remains valid if it is writ- 
ten as f= ap . On the other hand, the formula J’ = ma will no longer _ 
be valid in all cases. 
158. Energy 


In Sec. 10, we obtained an expression for the energy of a moving 
body by finding a function that increases as the work expended in 
accelerating the body. Let us repeat these calculations taking into 
account the corrections provided by the theory of relativity. 

The work of displacing a body by a distance dJ is 


d dt 
Fdl=? dl=dp+=vdp, 


where v is the velocity vector. If this work serves to increase the 
energy @ of the body, then 


dé =v dp =vd (mv) =mv dv-+v? dm. 


Since m= Viz re then dm = aa hence, 
; ve : _ modv 
dé — (1+ =@=p5) mv dv, 1.€., dé = (I—p2) . 


Comparing the last expression with the formula for incremental 
mass, we find: d€ = c?dm, i.e., 


& =me?. 


We have dropped the additive integration constant, for when m= 0, 
6 must also be equal to zero. 

Thus, the work done on a body serves to increase the function 
é = mc’, which has, therefore, the significance of energy of the body. 

The fundamental result of this calculation consists in the follow- 
ing: an increase in the mass of a body is accompanied by an increase 
in its energy (and, hence, an expenditure of external energy); 
on the other hand, a decrease in the mass of a body or system is accom- 
panied by a decrease in its energy (and, hence, a transfer of energy 
to its surroundings). There is a direct and universal relationship 
between mass increment and energy increment, for c? is a constant 
quantity. 

But what is the nature of the energy 6? Is it an energy of motion? 
Evidently not. If the body is at rest, € does not equal zero but 
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equals mgc*. Therefore, Um c* is the rest energy of the body, i.e., 
the internal energy of the body, and the-difference mc* — moc* is 
the energy of motion. 

The first part of the last sentence should be viewed as an asser- 
tion that may be verified experimentally. As for the energy of mo- 
tion, mc?— moc?, this will be recognised as the familiar expression 
for the kinetic energy if the following approximation is used: 


1 1 
—S— 1 >= 2 
Vi-pe +76 
‘To this degree of accuracy, 
1 1 mgv2 
2 2 ee ~~ PRT ena «beh | 
mc? — moc? = moc ( Via 1) Moc? X 5 B ee 


Example. The internal energy of a body of mass mp = 1 kg is U = moc* == 
= 9 X 41026 joules = 2.16 X 10!3 keal. This is the equivalent of the quan- 
tily of energy that would he released in the form of heat in the combustion 
of 3 million tons of coal. Even in thermonuclear reactions, only several per 
cent of these tremendous reserves of internal energy are released at present. 
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As was indicated in the preceding article, the expression relating 
rest mass to the internal energy of a body, i.e., U = mac”, may be 
verified experimentally. 

The internal energy of a body consists of the rest energy of the 
component parts, their kinetic energy and-their potential energy 
of interaction. A change in any of these component energies affects 
the value of U and, hence, the rest mass as well. Thus, the rest mass 
increases if the temperature of the body rises, i.e., if the internal 
motion of the system increases. The rest mass also increases if repel- 
ling components of the system approach one another or if attracting 
components move apart. 

It is clear from the above that the rest mass of a system of inter- 
acting particles does not possess the property of additivity, i.e., 
it is not subject to the law of conservation. If a body of rest mass 
M, consists of N particles, each of mass mg, then M7) + Nig. The 
difference 

M)—Nm = AM 


is called the mass defect of the body (or system of particles). The 
quantity 
CAM 


is called the binding energy. 
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If a system breaks up into a number of components, binding 
energy is released and may be measured. Moreover, the rest mass 
may also be directly measured. Thus, the U = moc? law may be 
verified experimentally. 

Numerical examples show that any change in internal energy 
related to a change in the velocity of motion and the interaction 
force between molecules and atoms cannot lead to a measurable 
change in mass. Experimental verification of this theory is possible 
in nuclear physics (see p. 540). 


Examples. 1. The mass of 1 kg of molybdenum increases by AM = 
= 0.000000003 gm when it is heated to 1,000°C. 

2. If a steel rod of 128 cm length and 4 cm? cross-section (mass of the rod = 
= 1 kg) is stretched by a force of 8 tons, the potential energy thereby stored 
in it increases its mass by 2 x 107!” gm. 


160. The Principle of Equivalence and the General Theory 
of Relativity 


Let us consider a noninertial system of coordinates moving with 
an acceleration @j. Assume that we wish to describe physical phe- 
nomena in this system. The laws of mechanics in this system will 
appear different than in an inertial system, for # = ma is valid only 
for the latter. A stationary body will have an acceleration —Ao 
relative to this system. 

If we maintain the terminology used for an inertial system and 
assume that acceleration is produced by forces, then the “force” 
field—may acting on all bodies in an accelerated system may be 
called an acceleration field and an analogy may be drawn between 
this field and a gravitational field. ; 

In exactly the same manner, we may introduce additional “force” 
fields in considering phenomena in a rotating system of coordi- 
nates and, of course, in the general case. The fictitious force fields 
that we have introduced for the description of motion from the 
viewpoint of a noninertial system of coordinates may be called fields 
of inertial forces. The force—may is an inertial force. 

The motion of a point having an acceleration @ relative to such 
a noninertial system will obey the equation 


ma = I+ inertial forces. 

Expressions for inertial forces may be found in textbooks on 
theoretical physics. 

It is important to direct our attention to the theoretical side of 
this question. In noninertial systems, fictitious force fields appear. 
To each such acceleration field there corresponds a fictitious distri- 
bution of attracting mass. Hence, any field created by accelerated 
motion may be interpreted, generally speaking, as a gravitational 
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field. In this sense, we sometimes speak of the equivalence of gravi- 
* tation and acceleration. 

Let us consider several simple examples. Assume that we are in 
an elevator falling with an acceleration a. Let us drop a ball and 
examine the nature of its fall. As soon as the ball is dropped it be- 
gins, from the viewpoint of an inertial observer, to fall freely with 
acceleration g. Since the elevator is falling with acceleration a, the 
acceleration relative to the elevator floor is g — a. An observer in 


the elevator can describe the 


motion of the falling body by a. 

Means of the acceleration 

g’ = g —a. In other words, 

the observer in the elevator 

need not speak of the acceler- fen 

ated motion of the elevator TL 

since he has “changed” the ae ea, 

acceleration of the gravitation- ale 

al field in his system. | Gy? 
Now, let us compare two Wii 

elevators. One is suspended wt! 

over the Earth and the other V 


moves in interplanetary space 
with an acceleration @ rela- 


tive to the stars. All bodies in 
the elevator suspended over the Earth are able to fall freely with 


acceleration g. But bodies inside the interplanetary elevator have 
a similar capability. They “fall” with an acceleration—a te the 
“bottom” of the elevator. The role of bottom is played by the wall 
opposite to the acceleration direction. 

Thus, the action of a gravitational field and the manifestation of 
accelerated motion are indistinguishable. 

The behaviour of a body in an accelerated system of coordinates 
is the same as the behaviour of a body in the presence of an equiva- 
lent gravitational field. However, this equivalence is complete only 
if we limit ourselves to observations over small portions of space. 
Thus, imagine an “elevator” having linear floor dimensions of several 
thousand kilometres. If such an elevator is suspended over the 
Earth, the phenomena occurring in it will differ from those occurring 
in an elevator moving with an acceleration g relative to fixed stars. 
This is clear from Fig. 185. In one case bodies fall obliquely to the 
bottom of the elevator, while in the other case perpendicularly. 

Thus, the principle of equivalence is valid for such volumes of 
space in which the field may be considered uniform. 

The above qualitative considerations lie at the basis of the gen- 
eral theory of relativity. This theory was also developed by Ein- 


Fig. 185 
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stein. In it, he sought formulations for the laws of nature, independ- 
ent of the choice of coordinate system. Until now,-we have assumed 
that this was possible only for inertial systems of coordinates. The 
principle of equivalence shows that the absoluteness of acceleration 
may be destroyed by a gravitational field. An accelerated system 
of coordinates may be viewed as an inertial system if we introduce 
an equivalent gravitational field. To be sure, as we have just seen, 
such equivalence is limited in time and space. However, Einstein 
showed that this restriction may be removed if, corresponding to 
the introduction of the gravitational field, a change in the geometry 
of the system is introduced. 


CHAPTER XXV 
THE QUANTUM NATURE OF A FIELD 


161. Photons 


On a number of occasions we have indicated that radiation and 
absorption of electrical energy occur in packets, or quanta. The 
magnitude of a quantum depends only on its radiation frequency and 
is equal to hv, where h is a universal constant equal to 6.62 x 107°? 
erg sec. It should be noted that the quantum nature of radiation 
and absorption has been established already for the entire electromag- 
netic spectrum, i.e., from hard y-rays to long radio waves. 

The phenomena of radiation and absorption characterise, in the 
first place, the microsystem interacting with the electromagnetic 
field of a wave. The quantum nature of these phenomena (which 
we shall discuss in detail in Part III) shows that a microsystem has 
distinct energy levels and that the values of these energy levels 
cannot be arbitrary. These facts by themselves would not have led 
to the conclusion that this quantum nature is characteristic of an 
electromagnetic field as well as of matter if an electromagnetic wave 
in its interaction with matter did not behave, in a number of cases, 
as a particle. The corpuscular properties of electromagnetic radiation 
are manifested when losses and transformations of electromag- 
netic energy occur. The shorter the wavelength, the more distinct 
the effects. These properties, on theother hand, are not manifested 
during propagation, scattering and diffraction of electromagnetic 
waves if these processes are not accompanied by energy losses. 

A corpuscle of an electromagnetic field is called a photon. It 
is characterised, in the first place, by the magnitude of its energy: 


G=hy. 
Using the law of equivalence of mass and energy, we are entitled 


to ascribe to a photon the mass 


é hv 
We nn 


Since an electromagnetic field is propagated with a velocity c, it 
mo 


Vi-R 


must be concluded from the formula m = that the rest mass 


of a photon is equal to zero. 
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Assuming the concept of momentum is applicable to a photon, 


we obtain 


ee 
p=me=—. 


It should be recalled that the Lebedev experiments (see p. 321) 
directly demonstrate the validity for light of the formula p = y , 


the relationship between the momentum density and energy density 
of an electromagnetic wave. The formula for photon momentum is 
in complete agreement with this result. 


Values of 2, m and p for Photons of Various Types 
of Electromagnetic Radiation 


| au . 2 m(gm) | p(gmem/sec) 

Radio waves. 2,000 10-21 erg =0.62 10-9 ev |1.1 10742! 3.3><10-32 
metres 

Visible light 6,000 A | 3.3x10-12 erg=2 ev (3.6 10-33] 1.4 10-22 

"X-rays ... 1A 19.8 10-® erg = 12,400 ev}2.2x10-29] 6.6><10-19 


As may be seen from what follows numerous experiments convinc- 
ingly show that photons exist. On the other hand, a mass of exper- 
imental evidence prevents us from abandoning our view of an elec- 
tromagnetic field as a continuum. The sharpest contradictions arise 
in considering interference phenomena. These phenomena are ele- 
gantly explained by the wave nature of the field, but are completely 
inexplicable from the corpuscular viewpoint. 

Thus, consider a simple interference arrangement—two close 
apertures through which light may be transmitted. The following 
experiment is easily performed. First, let us photograph the trans- 
mitted light when both apertures are open. We obtain the pattern 
discussed earlier, namely, alternate bright and dark fringes. Now, 
let us close each of the apertures in succession and take the photo- 
graph on one plate. The result, ‘of course (from the viewpoint of wave 
theory), will be different, i.e., there will be no interference. 

Let us now consider how this experiment may be interpreted in 
corpuscular terms. It is conceivable (by stretching one’s imagina- 
tion) that photons fall unequally on different parts of the photograph- 
ic plate owing to rebound from the edge of the aperture or collision 
with one another. But the patterns obtained differ depending on 
whether the light passes through both apertures simultaneously or 
consecutively. Photons passing through one aperture “know” wheth- 
er the other aperture is open or closed. 
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This experiment and many others show that it is quite impossible 
to reduce electromagnetic phenomena to only a field pattern or to 
only a system of photons. Each concept is exceedingly fruitful in the 
case of one group of phenomena, but fails in the case of the other. 

During the last few decades, physicists have energetically sought 
ways of reconciling these two contradictory views of electromagnet- 
ic radiation. A field is a reality characterised by continuous values 
of field intensity in space and time; a corpuscle is a reality occupying 
a certain limited region of space at a given instant. These contra- 
dictory qualities are combined in electromagnetic radiation. In 
Chapter XXVII, we shall see that these contradictory properties 
are combined not only in the case of electromagnetic radiation, but 
in the case of matter as well. However, physics has made consider- 
ably more progress in understanding matter than in understanding 
an electromagnetic field. The dual nature of particles of matter is 
described by the Schrédinger equation (see p. 474); interactions 
between corpuscles and waves for such particles are understood 
quite well. 

Unfortunately, the situation is much worse as regards electro- 
magnetic field (radiation) theory, commonly referred to as quantum 
electrodynamics (for a detailed discussion, see p. 506). Such a com- 
plete theory does not exist. In view of the fundamental contradic- 
tions existing in quantum electrodynamics, its partial successes, 
expressed in the establishment of new relationships between field 
and particles, cannot be generalised. Hence, the interrelation be- 
iween photons and electromagnetic field remains unclear. 

The rules of “translation” from corpuscular terminology to wave 
terminology, and vice versa, are based on the following: An electro- 
magnetic wave of length A and intensity J may appear as a stream 
c 
he 
is the number of photons passing per unit time through unit area. 
The direction of motion of the wave front is the direction of motion 
of the photon. 

We shall not discuss in corpuscular terms the very complex 
problem of the polarised state of light. To do this, it is necessary 
to assume that a photon has a selected direction, or spin (see p. 496 
regarding electron spin). 


of photons of frequency v = — and intensity J = Nhv, where N 


162. Photoelectric Effect 


The escape of electrons under the action of electromagnetic waves 
constitutes important confirmation of the indispensability of the 
corpuscular viewpoint. This phenomenon will be considered here 
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from this viewpoint, and again in Sec. 274 when we discuss the 
action of light on metals and semiconductors. 

Since the escape energy of an electron from a metal (sce p. 694) 
is not less than 2.2 ev, the photoelectric effect becomes possible 
when hv > 3.5 X 10°” erg, i.e., for frequencies of the order of 
0.5 x 10° cycles (A = 6,000 A). 

Einstein proposed that the photoelectric effect be viewed as an 
effect of collision between a photon and an electron. In this process, 
the photon gives up all of its energy and ceases to exist. If A rep- 
resents the work function of the electron, i.e., the work required 
to overcome the binding force between the electron and the sub- 
stance, the law of conservation of energy has the form 


mv2 


hv=A+ 5? 


ai ab ns 
where > is the kinetic energy of the photoelectron, the electron 


dislodged from the substance. 

The first means of checking the validity of the photon hypo- 
thesis consists in verifying the linear dependence between photoelec- 
tron kinetic energy and frequency of incident radiation. 

The photoelectron energy is determined by the bias potential 
method. If the surface of the substance from which the electrons 
are dislodged constitutes a condenser plate, current flows through 
the circuit in which this condenser is connected. Current ceases to 
flow when an appropriate bias voltage is applied to the condenser. 
This condition is given by 

mv2 
eu, = 3° 
{[t should be realised that the greater the depth from which the 
electrons are dislodged, the smaller the velocities. Therefore, cur- 
rent ceases to flow when the electrons closest to the surface are pre- 
vented from escaping. By experimentally determining eU, for var- 
ious frequencies v of electromagnetic radiation, curves of Uy vs. 
v may be plotted. The ideal straight lines obtained are shown in 


Fig. 186. The slope a of the straight line eV, = hv — A may be 


calculated from other data, providing another independent means 
of checking the validity of the theory. 

Nevertheless, the above experiment cannot be considered direct 
proof of the photon hypothesis. The possible objection is that the 
photoelectron may gradually accumulate the energy transmitted 
to it by an electromagnetic wave. This objection was answered by 
the classical experiment of A. F. Yoffe and N. I. Dobronravov. 
Earlier, Yoffe had investigated the photoeffect by means of a par- 
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ticle of dust suspended between the plates of a condenser. Owing to 
inevitable air friction, the particle of dust carries a charge and, 
hence, its weight may be counterbalanced by an electric field. For 
equilibrium g# = mg, where m is the mass and q is the charge of the 
particle of dust. In the photoeffect process, the particle of dust loses 
an electron and, hence, depending on the sign of g, changes its charge 
by q +e org —e. The particle of dust is then no longer in equilib- 
rium and begins to move towards one of the condenser plates. To 
counterbalance the dust 
particle, it is necessary 
to change the field. The 
equilibrium condition is 
now 
(qe) E=meg. 


UY, , volts 


In this manner, Yolfe 
determined the charge of 
an electron. 

Now, let us describe 
the Yoffe and Aad 
ravovy experiment. Here, si, 'A 
too, the behaviour of a : Hise tee 
dust particle suspended 
between the two plates of a condenser was observed, but now the 
goal was different. The anode of an X-ray tube served as one of 
- the condenser plates. A voltage of 12,000 volts was applied to the 
tube and the X-rays were created by an exceedingly weak electron 
stream of about 1,000 electrons per second. 

As is well known, X-rays are created when an electron strikes 
an anode. But what is radiated by the anode? Is it a continuous 
electromagnetic field or 1,000 photons per second? The dust particle 
between the condenser plates enables us to obtain the answer. The 
X-rays dislodge electrons from the dust particle. But how do they 
do this? 

The Yoffe and Dobronravov experiment showed that, on the 
average, one electron was dislodged from the dust particle every 
30 minutes. If the X-rays were propagated in the form of a contin- 
uous field, then at each instant the dust particle would have ob- 
tained a very minute amount of energy, insufficient of course to 
dislodge an electron. This energy would have been evenly distribut- 
ed among all the electrons of the dust particle. From the wave 
viewpoint, a quite inconceivable conclusion would have to be drawn 
from the Yoffe and Dobronravov observations, namely, that once 
every 30 minutes all the electrons transfer energy to one electron, 
which then escapes from the dust particle. 
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The photon hypothesis not only explains the phenomenon qual- 
itatively but quantitatively as well. The dust particle in the above 
experiment consisted of a bismuth spherule having a radius of 3 X 
x 10-* cm. It was located at a distance of 0.02 cm from the anode, 
from which X-rays emerged in all directions. The probability of 

ce : . (3X 10-5)2 1 
a photon striking the dust particle is Fa W0D = T)800,000 ° 
Since in 1 second 1,000 photons are dislodged, on the average 1 pho- 
ton will strike the dust particle every 1,800 sec (30 minutes), which 
agrees with the experimental result. 


163. Fluctuations in Luminous Flux 


The experiments of S. I. Vavilov devoted to the study of fluc- 
tuations in luminous flux of low intensity provide important exper- 
imental corroboration of the photon theory. 

It turns out that the eye’s threshold of sensitivity to light is 
exceedingly low. The human eye is capable of perceiving approxi- 
mately 100 photons per second falling on the cornea. If the luminous 
flux fluctuates about this value, light will not be perceived by the 
eye when the number of photons drops below the threshold value. 

In the Vavilov experiments, the investigator observed a beam 
of light that was discharged every second for a time interval of 
0.1 sec. When the value of the luminous flux exceeded the threshold 
of sensitivity, the eye perceived every flash of light. When the 
light intensity was decreased, some of the flashes were no longer 
perceived by the observer. The lower the light intensity, the greater 
the number of flashes that were not perceived. Thus, fluctuations 
in the number of photons in the luminous flux were directly ob- 
served. It is difficult to provide more direct evidence of the corpus- 
cular nature of light. 

Other experiments performed by Vavilov clearly show that such 
typically wave phenomena as interference cannot be explained by 
the photon hypothesis. Using a Fresnel biprism, Vavilov divided 
a beam of light into two coherent components. These components 
yielded an interference pattern. At.the same time, fluctuations in 
both beams of light were completely independent. This circum- 
stance again shows that it is quite impossible to explain interference 
as some Statistical distribution of photons. 

Wave properties are inherent in every photon rather than in 
a stream of photons. Thus, a photon can in no way be viewed as an 
“ordinary” particle. 

At this point, we must digress somewhat. In creating a model 
of the invisible world, we endow elementary particles with properties 
borrowed from the world of things (materials) around us, or, as they 
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say in physics, from the macroworld. Thus, for example, atoms are 
conceived as spherules. Needless to say, an atom spherule only par- 
tially reflects the properties of a material spherule. Everyone knows, 
for example, that such properties inherent in a material spherule 
as colour, roughness and odour cannot be transferred to an atom sphe- 
rule. The more we penetrate into the microworld, the more difficult 
it is to endow elementary particles with material properties. 
Components of an atom or atomic nucleus and particles of light 
resemble a material spherule even less than an atom does. In the 
case of a photon, we saw that it is possible to combine in a micro- 
particle conflicting properties of the macroworld. Of course, in the 
macroworld, a particle is a particle and a wave is a wave. A particle 
occupies a limited region of space and travels along a definite path. 
A wave is distributed continuously in space and the energy is trans- 
ferred to one or another region from all points in space. For mate- 
rials, these two views are irreconcilable. But we have no right to 
impose the behaviour of materials on particles of the micro- 
world. : 
Cognition of the microworld does not consist in the creation 
of a model resembling the pictures familiar to the human eye. The 
infinite process of cognition consists in the investigation of the 
regularities of phenomena, the determination of objectively existing 
causal relationships between phenomena. In this manner, a complex 
picture of the microworld, whose essence cannot be transmitted by 
any ingenious model borrowed from the macroworld, is obtained. 


164. Kirchhoff’s Law 


It has been experimentally established that two bodies having 
different temperatures tend to equalise their temperatures even 
when the bodies are in a vacuum. The energy exchange occurs by 
means of electromagnetic waves radiated by the atoms of these 
bodies. 

As was indicated above, a specific system of energy levels is 
associated with every atom. When an atom absorbs energy, its 
energy level rises; when it radiates, its energy level decreases. Dur- 
ing every radiation process, an atom releases into space an electro- 
magnetic energy hvmn = 6m — Gn, Where 6, is the energy level 
before radiation and @, the level after radiation. The radiated wave 
has a frequency V,. This wave arrives at the other body and is 
absorbed by it. In this case, the energy level of the atom absorbing 
energy is raised from @, tO 6m. 

The same thing may be expressed in terms of photons. Thus, it 
may be stated that during every radiation process a photon of elec- 
tromagnetic energy hv is released; during absorption the photon is 
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captured by an atom and its energy serves to raise the energy level 
of the atom. 

All the atoms of the bodies participate in the energy exchange— 
sometimes a photon is absorbed, sometimes a photon is radiated. 
Depending on the random circumstances, the most varied energy 
transitions may occur and, in principle, electromagnetic waves of 
any wavelength may participate in the energy exchange. : 

Let us assume that the bodies participating in the heat exchange 
form a closed system, i.e., the system of bodies under observation 
is surrounded by an envelope that prevents radiation from passing 
through. Then, after a certain interval of time, these bodies reach 
a state of equilibrium and assume the same temperature. This does 
not mean that electromagnetic radiation ceases. As before, a tran- 
sition will sometimes occur to a higher energy state of an atom and 
sometimes to a lower. But if the equilibrium state has been reached, 
then for each body, at each instant of time, equal quantities of 
energy will arrive and leave. This is true for radiation of any wave- 
length. In general, the radiation arriving at a body is only partial- 
ly absorbed, raising the energy levels of its atoms. The other part 
of the incident radiation is scattered, i.e., reflected, by the body. 

Atoms do not maintain their high energy levels long: in returning 
to their original state, they give up the absorbed energy in the form 
of radiation. If the energy incident on a unit area in 1 sec is designat- 
ed by p, the absorbed energy may be expressed as Ap. The dimen- 
sionless coefficient A, indicating the fraction of energy that is ab- 
sorbed, is known as the absorptivity of the body. Evidently, if 

Ap=é, 
where € is the energy radiated from 1 cm? of surface in 1 sec, the 
body is in equilibrium with its surroundings and its tem perature 
does not change. 

But what is the condition for thermal equilibrium of many bodies, 
which may have, of course, different absorptivities and different 
radiations? On the basis of thermodynamical considerations, Kirch- 
hoff showed that equilibrium is possible only if the intensity of the 
electromagnetic waves incident on a body is the same for all portions 
of the bodies in equilibrium with one another. Thus, 


Frat =... =. 


This relationship is known as Kirchhoff’s law and is valid for any 
wavelength and any temperature. It states that the ratio of the emis- 
sive power of a body to its absorptivity is a constant for any wave- 
length and temperature. 

This means that a body that is a good absorber of certain rays 
is also a good radiator of these rays, and vice versa. Why does the 
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temperature of water in a bottle coated with silver rise slowly, and 
the temperature of water in a dark flask rise rapidly, under the action 
of solar rays? In the first case there is little absorption of solar ener- 
gy, while in the second there is considerable absorption. Now, let us 
assume both vessels are filled with hot water and placed in a refrig- 
erator. The water in the dark flask cools much more rapidly since 
the better absorber is also a better radiator. 

A striking experiment may be performed with coloured ceramic. 
If the colour of a body is, for example, green, it will not absorb green 
light. Thus, if we heat a green crock, it is seen that it begins to 
assume a colour complementary to green. 

It should not disturb us that we have applied a law established 
for equilibrium to phenomena involving bodies clearly not in equi- 
librium (the body is at a higher temperature than its surroundings). 
The situation here is exactly the same as in the case of other thermo- 
dynamic problems (cf. p. 164): the laws of thermodynamics are appli- 
cable if every instantaneous state may be viewed as an equilibrium 
state. In thermal radiation phenomena, this condition is always 
satisfied. 


165. Black-Body Radiation 


Kirchhoff’s law has an interesting consequence. Bodies exchanging 
heat by means of radiation receive (for given values of v and 7) 
the same electromagnetic wave intensity from their neighbours, inde- 
pendent of the material and properties of the bodies. For every 
wavelength (or, what amounts to the same, every frequency) and for 
every temperature, experiments yield a universal value for p. Thus, 
there exists a universal function p (v, 7), i.e., a function of the 
radiation frequency and temperature, characterising the process of 
thermal exchange by radiation. 

The meaning of the function p (v, 7) is easily explained. Consider 
a body absorbing 100 per cent of the energy incident on it for alt 
wavelengths. For such a perfectly black bedy, A = 1 and 


G= (Vv, TL). 


The function p (v, T) is the emissive power of a perfectly black body. 
But what kind of body absorbs light of all wavelengths? Of course 

substances such as lampblack (soot) are almost perfectly black. 
However, all such substances fall short by several per cent of the con- 
dition A = 1. A more ingenious solution exists. Imagine a box having 
a’small aperture. If the aperture’s dimensions are made sufficiently 
small, it may be made perfectly black. This property of apertures is 
well known from everyday observations. A deep hole, an open win- 
dow nonilluminated from within the room and a well are examples 
of perfectly black “bodies”. It is clear what happens in these cases: 
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A beam entering a cavity through an aperture is able to emerge only 
after repeated reflections (see Fig. 187). But with each reflection, 
part of the energy is lost. Therefore, in the case of a small aperture 
and a large cavity, the beam is unable to emerge, i.e., it is com- 
pletely absorbed. 

To measure the emissive power 0 (v, T) of a perfectly black body, 
a long tube made of refractory material is placed in an oven and heat- 
ed. Through an aperture in the tube, the nature of the radiation is 
studied by means of a spectro- 
graph. The results of such 
experiments are shown in 
Fig. 188. The radiation inten- 


| 


Radiation intensity 


sily is plotted as afunction of the wavelength for several tempera- 
tures. It is seen that the radiation is concentrated in the relatively 
narrow spectral interval of 1 to 5 p. Only at high temperatures do 
such curves take in portions of the visible spectrum and begin to 
advance in the direction of short waves. Waves whose wavelengths 
are several microns long are called infrared waves. Since for or- 
dinary temperatures Lhey are the main carriers of energy, we call 
them heat waves. 

The higher the temperature, the more distinct the maximum of 
a thermal radiation curve. With increasing temperature, the wave- 
length A,, corresponding to the maximum of the spectrum is displaced 
in the direction of shorter wavelengths. This displacement obeys 
the Wien law, which is easily established experimentally: 


2,886 
Ree OE 


In this formula, the wavelength is expressed in microns and 7 in 
degrees Kelvin. The displacement of radiation in the direction of 
shorter wavelengths may be detected when a metal is heated. As the 
temperature increases, the colour of the heat changes from red to 
yellow. 
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We call the reader’s attention to another feature of the radiation 
curves, namely, it will be noted that all the ordinates increase sharp- 
ly with increasing 7. If 6, is the intensity for a given wavelength, 
the total intensity of the spectrum is expressed by the integral 


fo) 


rs \ Gidh. 
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This integral is simply equal to the area under the radiation curve. 
Exactly how rapidly does R increase with increasing T? Analysis 
of the curves shows that it increases very rapidly, namely, propor- 
tionally to the fourth power of the temperature: 


R=oT' ergs/cm? sec, 


where o = 5.7 x 10-5 (Gaussian units). This is the Stefan-Boltz- 
mann law. 

Both laws are important in the determination of the temperature 
of hot bodies at great distances. In this manner, the temperature of 
the Sun, stars and the ball of fire of an atomic explosion are deter- 
mined. , 

The laws of thermal radiation are basic to the determination of 
the temperature of smelted metal. The operation of optical pyrome- 
ters is based on the selection of the heating for an electric bulb fila- 
ment in such a manner that the luminosity of this filament becomes 
the same as the luminosity of the smelted metal. We make use of 
the following law: if radiations are the same, so are the temperatures. 
As for the temperature of the heated filament, it is directly propor- 
tional to the electric current flowing through the filament. Hence, 
it is not difficult to calibrate an optical pyrometer. Since actual 
bodies are not perfectly black, it is necessary to introduce in each 
case in the Stefan-Boltzmann formula a factor less than unity (the 
absorptivity of the given body). These factors are determined empiri- 
cally and are significant in heat engineering where problems of heat 
exchange by radiation are extremely important. Nevertheless, the 
above laws are valuable since the general behaviour of radiation 
(dependence on temperature and wavelength) is maintained for 
bodies that are not black as well. The theoretical aspects of black- 
body radiation are discussed in the following article. 
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Let us consider a cavity within which absorption and radiation of 
electromagnetic waves occur. It is immaterial whether this cavity 
is in the form of a sphere or a rectangular parallelepiped. The walls 
of the cavity radiate and absorb equal quantities of energy, i-e., 
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the entire system is in equilibrium. Within the cavity there is an 
electromagnetic field which is in equilibrium with the walls: at all 
points the energy density of the field, w = i (#? + H*), is con- 
stant in time. 

This electromagnetic field may be viewed in two different ways. 
From one viewpoint, there are standing electromagnetic waves in 
the cavity, just as there are standing sound waves in a closed room 
with sound sources. From the other viewpoint, in view of the quan- 
tum nature of the field, it may be stated that the space under consid- 
eration is filled with photons, just as a vessel containing gas is 
filled with molecules. 

From the wave viewpoint, the number of frequencies of electro- 
magnetic oscillations occurring in the cavity may be easily deter- 
mined. The reasoning used for sound waves (see p. 136) is completely 
applicable here too. The number of characteristic frequencies of 
electromagnetic oscillations less than v is equal to 


4 _ v3 
ae 


where c is now the velocity of electromagnetic waves and V is the 
volume of the cavity. This formula gives the number of oscillations 
for the case of linearly polarised waves. In the case of thermal radia- 
tion, we are dealing with nonpolarised oscillations, which may be 
always resolved into components along two axes. 


. Here, the number of oscillations is twice as large and is equal to 
Vv : a . a . 
3 %<7V. Differentiating, we obtain the number of oscillations in 


the frequency interval from v to v + dv: 


Ba" VY dv. 


cé 


Now, let us consider the situation from the “other side of the coin”. 
From this viewpoint, the cavity is filled with oscillations of frequen- 


cy v —in other words, with photons of energy ¢ = hv. The expression 
“ Vdv may be viewed as the number of photons in the cavity, 


2 
and Sav" dv as the density of the photon gas. We shall soon be able 
to answer the following important question: What is the electromag- 
netic energy density in the cavity? If photons of all energies were cre- 
ated in equal numbers, it would merely be necessary to multiply 


2 
e by a 


interval dv. However, the particle energies are not distributed uni- 


dv to obtain the energy density for frequencies in the 
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formly. Therefore, the formula being sought has the form 


Sav? EW (e) dv, 


wy dv = 3 


where W (e) is the probability of a photon of energy e being created. 
Thus, the electromagnetic energy density for waves (photons) 
of frequency v is given by the formula 


Wy = 


Sav" EW (e). 


c3 


The energy flux through a unit area, i.e., the Poynting vector K, 
is c times wy (see p. 325). But the energy flux p radiated from a unit 
area of a body in equilibrium with the field is one-fourth of the value 


of the Poynting vector: p=TK . Thus, between w and 9p, the fol- 
lowing relationship exists: p=tw. What is the origin of the coef- 


ficient =? Since, on the whole, this is not a very important matter, 


£ 
the following simplified explanation should suffice. 
Every unit area radiates an energy flux po in all directions within 
the limits of a hemisphere, i.e., a solid angle 2x. Thus, the average 


radiation within a unit solid angle is equal to fe . From geometry 


considerations, it is clear that the radiation is equal to zero in the 
plane of a unit area and a maximum along its normal. If the decrease 
in radiation intensity were uniform, to obtain the average value 


zit would be necessary for the radiation along the normal to be 


equal to a é 


Now, consider a sphere filled with radiation. At the centre of the 
sphere there is a unit area through which there is an energy flux K. 
On the other hand, however, the radiation falling on this area from 


all parts of the sphere is equal to t. x 4. Hence, p= 5K : 


Thus, using the formula for the volume density of electromagnetic 
radiation, we obtain an expression for the emissive power of a black 


body by multiplying w, by =: 


py = eW (e). 


Further investigation of this function involves evaluation of 
W (e), the energy distribution probability. Historically, the first 
formula for p, was proposed in 1911 by Rayleigh and Jeans independ- 
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ently of each other. It has the following form: 


2nkT 4 
y= =o" v~. 
c 


This formula was obtained by assuming uniform distribution of ener- 
gy per degree of freedom, i.e., W independent of ¢. It is valid for 
long wavelengths and high temperatures. 


Another possibility for W (e) is to use the Boltzmann law, which 


& 
was so successful in the case of molecular gases. Then, W (e) =e "7. 


However, as may be seen from Fig. 189, both the Wien emissive pow- 
er formula, 


hv 

Qnv2 Pepe Ae 

2 7 
Pv=—3 hve kT , 


and the Rayleigh-Jeans formula do not agree with experimental 
results. 

Where is the fallacy in reasoning in these cases? It must be sought 
in the inapplicability of the statistical reasoning lying at the basis 
of Boltzmann’s law to an aggregate of photons. As we have already 
emphasised, photons give us a one-sided picture of an electromagnet- 
ic field. The reality of the field cannot be completely represented by 
a collection of particles. It is, therefore, natural that photons 
should have their “own statistics” (Bose-Einstein statistics). 

To obtain the new function of particle energy distribution replac- 
ing the Boltzmann law, all we need do is take proper account of the 
fact that the wave nature of the field makes the concept of a differ- 
ence between identical particles meaningless. Bose-Einstein sta- 
tistics is founded on this new basis (see p. 684). It leads to the fol- 
lowing law of photon energy distribution: 

Wij ae 


v 
RT oy 


Therefore, the formula for the emissive power of a perfectly black 
body has the following form: 


_ anv hv 
vo 2 hv 
etf_y 


This formula was first obtained by Planck and is named after him. 
The excellent agreement between this theoretical formula and exper- 
imental results, and the nature of the deviations of the Wien and 
Rayleigh-Jeans formulas, are illustrated in Fig. 189. 
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The Wien and Stefan-Boltzmann laws considered above follow 
from Planck's formula. To prove the first of these laws, it is necessary 


to solve the problem for an extremum, i.e., find the root of the 
equation 


Oy 
ov 0. 


To prove the second law, it is necessary to find 


; py dv. 


We leave these calculations to the reader. 


PART THREE 
STRUCTURE AND PROPERTIES OF MATTER 


CHAPTER XXVI 


STREAMS OF CHARGED PARTICLES 


The simplest form of matter is an aggregate of charged elementary 
particles—electrons and ions. Systems of charged particles are en- 
countered in the form of beams of particles in which all the particles 
have a common velocity and move in a single direction and in the 
form of a gas in which the particles move randomly. Intermediate 
states are, of course, also possible. In this chapter, we shall consider 
the basic physical phenomena of such systems and describe equipment 
in which beams and gases of charged particles are used. Problems 
of electron emission, directly related to the physics of solid bodies, 
will be discussed in Chapter XXXVII. 


167. Motion of Charged Particles in Electric and 
Magnetic Fields 


A force f = eB + ~ [vB] is exerted on a charged particle in an 
electromagnetic field (see p. 267). If the fields HZ and B are given as 
functions of coordinates and time, and if the initial velocity and lo- 
cation of the particle are known, then for a particle moving with 
a velocity v < c the particle trajectory 7° (t) may be determined from 
the fundamental law of mechanics: 


d2> 
na = f. 


It is usually mathematically difficult to obtain an exact solution to 
this problem. An idea of the general nature of motion in a field may 
be obtained from an examination of the motion of a charge in a uni- 
form field. 

A Particle in an Electric Field. Assume that a particle enters 
a field at an angle 90° + a (see Fig. 190). For the choice of coordi- 
nates shown in the figure, the equations of motion take the form: 


de 
: =0. 


e 
oa Sk and d 
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Whence, 


e 
vy = — 7 Bt + voy and Vy = Vox. 
Integrating again, and assuming 2 = 0 for ¢ = 0, we obtain 
1 2 
y= — FH Eb + voy and r= Upxt. 


Eliminating, we obtain an equation of a parabolic curve which 
describes the motion of the electric charge (dotted in Fig. 190). 

If the particle enters the field al. 
right angles (vo, = 0), ils path is 
described by the equation 


If the particle enters the field along 
a line of force, it will continue to move 
along the line of force with an accel- 


7 é€ 
eration —£. 
m 


Fig. 190 Designating the potential difference 

between the inilial and final posi- 

tions of the charged particle by V and using the kinetic energy 
equation, we obtain 


eVi= > (v2 — v3). 


If the final velocity v » vo, then 


72 
v= and v= 2 ea 


m 


This equation helps to make clear why the unit of energy known 
as the electron-volt is widely used: 


fev = 1.63 x 10°! org, 


An electron-volt is the work done in moving an electron through 
a potential difference of 1 volt. This unit may be conveniently em- 
ployed when the energy refers to a single elementary particle. The 
work of ionisation and the dislodging and escaping of an electron 
from a metal range from several to several score of electron-volls. 

A Particle in a Magnetic Field. The properties of the force acting 
on a charged particle in a magnetic field are well known (see p. 268). 

Assume a particle enters the field with an initial velocity vo. Let 
us resolve this veclor into the components Ui and vy, which are 
parallel and perpendicular to the field, respectively. Then, for motion 
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in the plane perpendicular to the field, we obtain 
e 
ma = —v,B. 


In the longitudinal direction, the particle will move uniformly with 
a constant velocity v;. 


ee ve 
Motion in the perpendicular plane is circular, and a = +z is the 
centripetal acceleration. Thus, 
= me 
—v,B= Ro 
muye, ; ; . 
Hence, R=—s, ie. the radius of curvature is directly propor- 


tional to the particle velocity and inversely proportional to the 
magnetic induction. It should be noted that all particles of a given 
kind in a given field will 
have the same angular 
; 7 _ eB 
velocity, i.e., © = 77. 
Irrespective of the mag- 
nitudes and directions of 
the velocities, the parti- 
cles will have the same 
frequency of revolution 
about a flux line. 

If a particle enters the field at an incline to the direction of the 
field, it will move with a frequency o in aspiral of radius R (Fig. 191). 
Knowing v,, the projection of the velocity on the direction of the 
flux lines, we may determine the pitch of the spiral: 


ait 2nme 
2S v1 =v, x = eB Vi. 
Il is significant that the quantity vy = vo cos a, where @ is the angle 


formed between the initial velocity vector and the direction of the 
field, is constant to a high degree of accuracy even when the angular 
spread of the initial velocities is 5-10° (v, will vary by no more than 
1 per cent in such a case). Therefore, every z centimetres such 
a divergent beam of charged particles will converge in a point, i.e., 
focus (within the indicated limits) on a generating line of the cyl- 
inder on which the spiral trajectory may be viewed as winding. This 
pe aane line passes through the point at which the particle enters 
the field. 


Example. Assume that an electron, after being accelerated by a voltage 
V = 300 volts, enters a magnetic field of flux density B = 500 gauss at an 
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The secondary ionisers in the gas are electrons, not ions. The latter 
are able to ionise gas molecules only when the velocities are very 
high, i.e., greater than those usually prevailing. If the ions do not 
produce ionisation, the removal of the external ioniser stops the 
discharge even when the number of ion pairs due to impacts is hun- 
dreds or thousands of times larger than in the primary ionisation. 
Every avalanche is initiated by a primary electron, and since the 
electrons migrate toward the anode, the discharge ceases in the 
absence of an external ioniser as soon as all the electrons arrive at 
the anode. 

Such highly dependent discharges possess the following property: 
for a given voltage, the strength of the electric current passing 
through the gas is proportional to the number of primary ions creat- 
ed per unit time by the external ioniser. The ratio of the gas-ampli- 
fied current to the saturation current created by the primary ioni- 
sation may reach a value of several thousand. This property of 
a discharge is utilised in ionisation measuring devices—proportional 
amplifiers (see p. 532). 

An electric discharge may become self-perpetuating, i.e., continue 
after the external ioniser is removed, only if the ions also become 
creators of charged particles. This always occurs when the voltage 
is very high, i.e.—as indicated above—when the ions can ionise 
upon impact with gas molecules. In such a case, the ions will 
create more and more new electrons—the primary sources of ava- 
lanches. en! alien 

However, a self-perpetuating discharge may also occu? at, consid- 
erably lower voltages if the cathode of the gas-discharge tube takes 
the form of a plate. This is because ions are capable of dislodging 
electrons from a cold cathode. If the ion velocity is sufficient for 
such a process, the condition for a self-perpetuating discharge may 
be formulated as follows: new electrons appearing at the cathode 
must, at the very least, replace the work of the primary ioniser. 

Thus far we have said nothing about the role of pressure. At high 
pressures, the discharge column is compressed and thermal ionisa- 
tion begins. When the pressure changes, the current density distri- 
bution changes and so does the luminous nature of the gas discharge. 
At normal and higher pressures, various kinds of discharges are 
encountered, e.g., silent, arc and spark discharges. In the rare gases, 
glow discharges occur. 

A discharge is said to be silent when a leakage of charge from 
a condenser or other charged body is unaccompanied by sound or 
luminosity. Self-perpstuating silent discharges—brush discharge and 
corona—may occur on spikes, thin conductors and, in general, 
wherever sharp drops in potential, and, therefore, large field inten- 

sities, exist. 
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At higher voltages, spark discharges occur, i.e., the gas breaks 
down. The breakdown voltage depends almost exclusively on the 
gas pressure in Lhe region between the electrodes. At normal pres- 
sure, the air between spherical electrodes breaks down when the field 
intensity is 30 kv/em. Measurement of the breakdown distance may 
serve as a measure of high voltages. 

An electric arc is a special type of discharge. The current density 
in such a discharge is large even though the voltage between the 
electrodes is low. The distinctive 
feature of an are discharge, usual- 
ly created between carbon elec- 
trodes, is Lhe extraordinary high 
temperature allained by the elec- 
trodes. Therefore, thermionic 
emission from the cathode plays 
a large role in an arc. 

In the rare gases, a glow dis- 
charge has a characteristic form 
for every pressure. The degree of 
rarefication may be determined 
experimentally with great accu- 
racy by mere observation of the 
discharge form. Various types of 
gas discharge forms are shown 
in Fig. 193. 
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Ina gas-discharge tube, an elec- 
tron stream moves in the opposile 
direction to a stream of positive 
ions. To obtain an ion ray, i.e., 
a beam of ions moving in one 
direction, a hole or canal is made in the cathode. A large propor- 
tion of the ions entering this aperture passes through it and then 
continues to move by inertia. Such beams, called canal or positive 
rays, were known to physicists as far back as the last century. 
A similar method of obtaining an ion stream is used even today. 
First, a substance is transformed into the gaseous state. Then, its 
molecules are ionised and the posilive ions removed from the gas- 
discharge region through a cathode canal. 

A gas discharge is not used to create an electron beam. A so-called 
electron gun serves as an electron beam source. This is a device based 
on the phenomenon of thermionic emission (see p. 695). Heated 


29* 
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Fig. 193 
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metals, as is well known, may serve as electron sources. Thus, 1 cm? 
of tungsten surface heated to 2,400° yields in one second a number 
of electrons corresponding to a current strength of 1 ampere. 

Fig. 194 is a diagrammatic representation of an electron gun. 
To accelerate the electrons, a vollage is applied across the electrodes. 
A tungsten filament (Z) heated by an electric current serves as the 
cathode. The anode (2) has the shape of a glass with a round hole in 
the bottom. Electrons emerge from this aperture, which determines 
the divergence and width of the beam. 
The focussing electrode (3) makes it pos- 
sible to obtain beams of electrons which 
are fine and intense (see Sec. 171). 

The problem of obtaining an electron 
beam of maximum intensity for a given 
expenditure of energy is of great engineer- 
ing importance. 

To utilise all the electrons emitted by 
the filament, one must, in the first place, 
accelerate the electrons with a sufficiently 
high voltage. The filament emits a certain 
number of electrons per unit time. All 
these electrons must be drawn away from 
the filament. If the voltage is low, an 

Fig. 194 electron cloud which impedes emission 

is formed near the filament. As the volt- 

age is increased, the cloud is gradually dissipated and the’therm- 

ionic current increases. Finally, we reach a voltage with which 

no electron cloud is formed. A further increase in voltage does not 

result in an increase in thermionic current since saturation has heen 

reached. This is the condition required for electron gun operation. 

Thus, a sufficiently high voltage ensures that all of the electrons 
are drawn away from the filament. 

The next problem is to obtain increased electron emission from 
a filament. The emission from thorium and oxide cathodes is many 
times greater than that from tungsten cathodes. A thorium cathode 
consists of a tungsten wire coated with a very thin layer of metallic 
thorium. Thoriated tungsten yields the same current at 1,500° as 
pure tungsten does at 2,400°. An oxide cathode consists of a metal- 
lic base coated with a layer of an oxide of an alkaline earth metal. 
Such a cathode yields the same current at 900° as tungsten does 
at 2,400°. In modern electronic devices, oxide cathodes are 
heated indirectly. The cathode is manufactured in the form of a 
tube in which there is placed a tungsten spiral heated by an electric 
current. 
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171. Electron Lenses 


An electron beam may be controlled by electric and magnetic 
fields. The action of such fields is not restricted to the deflection of 
a beam from its original direction. Thus, a parallel beam of electrons 
may be made to converge or diverge, and a beam diverging at one 
point may be made lo converge at another. A “lens” for an electron 
gun is produced by a very simple system of fields. A very important 
branch of science known as electron optics, the most significant 
achievement of which is the electron microscope, .has developed 
on the basis of this principle. 

Let us recall the properties of an ordinary double convex lens. 
If an object is placed on one side of such a lens, the image of the 


Fig. 195 


object on the other side will be magnified or reduced in size. This is 
because all rays emerging from an object point gather at an image 
point and, moreover, all image points are located in a single plane 
perpendicular to the axis of symmetry of the lens. The simple geo- 
metric construction of Fig. 1495 shows why the lens operates in such 
a manner: the angle of deflection of a ray which impinges on a lens 
is proportional to the distance 2 between the axis of symmetry and 
the point of intersection of the ray and the lens. The construction 
has been made for an object point lying on the axis of symmetry, 
but the results are similar for other points as well. It should be stipu- 
lated (as is done in optics) that the discussion is valid if the lens is 
thin and the beam divergent within the limits of a small solid 
angle. : 

We shall now show that electric and magnetic fields having axial 
symmetry may serve as lenses. Such fields may be obtained by means 
of the following: electrically charged plates with a round aperture 
in one of them, cylindrical condensers, loops of current and flat coils. 
There are a large number of systems which may serve as lenses for 
electron rays. However, an example of an electrostatic lens and an 
example of a magnetostatic lens will suffice to clarify the principle. 


454 Streams of Charged Particles 


Let us consider a condenser in which a round aperture has been 
made in one of the plates (see Fig. 196). If an electron beam impinges 
on this aperture from the side of uniform field, the beam will be 


~ 
ill ae ~ 


' Fig. 196 


focussed: When an electron reaches the region of nonuniform field, 
a force perpendicular to the equipotential surfaces, and therefore 
at an incline to the axis of 
symmetry, acts on it. Resolv- 
ing this force into two com- 
ponents, we see that there is 
a radial component urging the 
electrons toward the axis. But 
this does not suffice for the 
system to act as a lens. It will 
also be necessary for the radial 
Fig. 197 component of the field to be 
proportional to the distance 
between the axis of symmetry and the point at which the electron 
reaches the plane of the aperture. It may be easily shown that 
this is indeed the case. The radial component of the electric field 
intensity may be expressed in the form 


1 dE 
ae rT 
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where < is the field intensity gradient along the axis of symmetry. 


To prove this, consider a small cylinder oriented as shown in 
Fig. 197, where the distance 7-2 is infinitely small. Since there is 
no charge inside the cylinder, ar°dE, the flux difference between 
the ends / and2, must be equal to —£,2nrdz, i.e., the flux through 
the lateral surface with the reverse sign. 

Thus, an aperture in a charged electric plate serves as a lens for 
an electron beam. 

Now, let us consider the behaviour of electron rays passing through 
a flat current-carrying coil (see Fig. 198). Such a coil constitutes 
a magnetostatic lens. The electrons move in a spiral and_ re- 
turn to the axis of symmetry 
after completing one turn ofa 
helix. The focussing properties 
of the coil are evident. It may 
be shown that the deflection 
angle of a ray is proportional 
to the distance of this ray from 
the axis of symmetry. The mag- 
netic coil changes the azimuth 
of the electron trajectory, i.e., 
in such a lens the image of an 
object is turned. But this angu- 
lar displacement does not dis- Fig. 198 
torl the electron-optical image. 

Thus, for an object scattering or radiating electron rays, an “elec- 
tron image” of the object may be obtained if an electrostatic or 
magnetostatic lens is placed in the path of the scattered electrons. 
When a photographic plate or luminous screen is placed in the 
plane of the image, a peculiar “picture” of the object is obtained. 
It is bright at points corresponding to the radiation or scattering 
of many electrons and dark at points corresponding to the absence 
of radiation or scattering in the object. Since a system of electron 
lenses yielding a magnified image of an object can be constructed, 
it is possible to construct an electron microscope. 


172. The Electron Microscope 


The electron microscope, i.e., a microscope in which the role of 
a light ray is played by a beam of electrons, provides exceptional 
opportunities, not yet fully utilised, to “observe” objects directly. 
This is because the possibilities of magnifying an object are, general- 
ly speaking, unlimited in an electron microscope. On the other hand, 
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an optical microscope provides a magnification of not more than 
2,000-3 ,000. 

To understand the reasons for this difference, we must familiarise 
ourselves with the resolving power of a- microscope. The question 
arises: What are the conditions for seeing two close points sepa- 
rately? 

Imagine that an ideal point source of light is located in front of 
a slit or round aperture.,When light passes through the aperture, 
a diffraction pattern is obtained. A lens placed behind the aperture 
does not concentrate the rays in a point. On the contrary, a blurred 
circle (or band, in the case of a slit) surrounded by alternately bright 
and dark rings appears. On p. 367, the magnitude of this blur for 
a slit was calculated. The radius of the disk, to which a point diffract- 
ed from a circular aperture corresponds, was given on p. 368. 


It is equal to 10901 

Every optical instrument must have an aperture of entry—the ' 
objective. Diffraction at the objective is inevitable, and any lumi- 
nous point in the focal plane of the instrument is diffused into 
a luminous circle. The angular dimension of the radial blur is equal 


X A : . ‘ : 
to 1.225. Therefore, its linear dimensions in the focal plane are 


equal to 1.2220 Here, f and D denote the focal distance and the 


diameter of the objective, respectively. In the case of a microscope, 
this formula gives merely the order of magnitude since the object 
is close to the objective and, as a result, the beam of rays cannot be 
considered parallel. But since we are only interested in the qualita- 
tive picture, we shall not go into the fine points. 

If two luminous points observed in a microscope are so close that 
the centres of their. luminous image fields are closer to each other 
than a distance equal to the field radius, these two points cannot 
be distinguished as separate points. 

The limit of linear resolution in a microscope is equal to 1.2241 : 
Since the ratio of the focal distance to the objective diameter cannot 
be made significantly less than unity, a microscope enables us to 
observe two points separated by a distance of the order of a wave- 
length. Thus, when viewing in ordinary light (wavelength of the order 
of 0.5 fu), we cannot detect object details smaller than a hundredth 
of a micron. 

What is the magnitude of useful magnification which may be 
obtained with an optical microscope? Imagine that a picture is 
viewed through an ocular, is photographed, then the latter photo- 
graph is viewed through an ocular, elc. It is evident that in this 
manner any desired magnification can be achieved. However, fur- 
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ther magnification loses all meaning when it is seen with the naked 
eye that the resolution limit of points of a photograph has been 
reached. Thus, if a photograph obtained with an optical microscope 
is magnified so that 0.5-1 mm _ corresponds 
to one micron, the limit of useful magnifi- 
cation has been reached. Hence, the useful 
magnification of such a microscope is about 
1-2 thousand. 

As will be shown in the next chapter, an 
electron ray has the properties of a wave of 
wavelength 

j=. 
mov 
where kA is Planck’s constant, m is the mass 
of an electron and v is its velocity. When 
the voltage is equal to 50,000 volts, the 
wavelength equals 0.05 A. But the distance 
between atoms is greater than 1 A. Hence» 
the usefulness of an electron microscope is 
not limited by its resolving power. 

Caléulations indicate that the resolution 
limit of an electron microscope is 2-3 ne 
At present, it is possible to achieve a re- 
solution of 5-6 A, i.e., a useful magnifica- 
tion of a million. 

It turns out that there is much in com- 
mon between light optics and electron optics. 
In electron-oplical instruments, we find the 
same elements and the same principles of 
construction encountered in ordinary optical 
instruments. The main difference (and this 
is not of a basic nature) is that the “index 
of refraction” of an electron-optical lens var- 
ies continuously, since the electric and 
magnetic fields vary continuously, while 
that of an optical lens varies abruptly 
(at its boundary). Fig. 199 is a diagrammatic representation of 
an electron microscope: 7) eleclron projector, 2) condenser lens, 
3) object, 4) objective, 5) intermediate image, 6) projection lens, 
7) final image, 8) observation window. If we wish to examine an 
image directly, a fluorescent screen may be used instead of a photo- 
graphic plate. An electron microscope is much larger than an optical 
microscope, requires a source of electric voltage and costs consider- 
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ably more. But this is compensated for by its tremendous resolving 
power. 

The electron microscope portrayed in the diagram employs mag- 
netostatic lenses. A high vacuum, of the order of 10-> mm of Hg, 
is created in the system in order to prevent electrons from colliding 
with air molecules. An electron gun produces a beam of electrons 


a) b) 
Fig. 200 


with an energy corresponding to 50,000 volts. Therefore, the instal- 
lation must include a high-voltage transformer to boost the line 
voltage to the indicated value. 

Different methods of observing an object by means of an electron 
beam exist. Since matter is a very strong absorber of electrons, 
its thickness must be no greater than a fraction of a micron if we 
wish to observe an object in the “window”. When electrons pass 
through a thin layer of a substance, they are scattered differently 
by different portions of it. Fig. 200 illustrates the two methods used 
for electron vision. Only those electron rays transmitted through the 
substance without scattering are allowed to pass, while the scattered 
rays are blocked by a diaphragm (Fig. 200a). In such a case, the bright- 
est parts of the image correspond to those portions of the substance 
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which do not scatter electrons, including those portions where the layer 
of substance is particularly thin. On the other hand, the parts of the 
object which scatter electron rays in profusion are dark. The second 
method is the reverse of the first (Fig. 200b). The object is placed at 
an angle to the axis of the microscope, so that only scattered elec- 
trons are directed through 

the lenses. It is. evident 


that the roles of bright and z Ty, 
dark fields in the image are Pig. 
now reversed. Sy 


The examination of objects 
in an electron microscope 
is usually performed on a 
base the thickness of which eat 
is about 0.01 p. Such a base ~ 
is made in the following 
manner. A drop of asolution szmmmts&rtu6 (feb hrererrerery 
of collodion in amyl acetate , 
is placed on the surface of 
water. The drop spreads on 
the surface forming a thin film which becomes quite firm after the 
amyl acetate evaporates. A loop made of thin wire is placed under 
the film. The object holder is now ready. This base will appear bright 
for normal incidence of the beam. It will appear dark when the 
beam impinges at an angle. 
If the objects under investi- 
gation are poor sctatterers of 
electrons, they will not be seen 
very well against the common 
background. The objects are 
sprayed with a metal to obtain 
more contrast. The base with 
the mounted object is placed 
in the path ofa stream of met- 
al atoms produced by vaporis- 
ing a metal in a vacuum. The 
spray is directed at an angle 
Fig. 202 to the base, and the sample 
becomes shaded as shown in 
Fig, 201. When an object is examined with electron rays an exceeding- 
ly bright picture is obtained since electrons are scattered only from 
the parts of the object sprayed with metal atoms. Fig. 202 shows 
how flu viruses look under an electron microscope. 
The examination of objects on a base is particularly important 
in biology and medicine. Bacteria are scooped up with the sample 


Fig. 20! 
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holder from the medium in which their presence is suspected. It is 

easy to study particles obtainable in a suspended stale since they 
can be scooped up with a holder. 

Entirely different methods are used in examining the surface of 

a solid. Under certain circumstances, a solid may be made to emil 

electrons. By passing the resulting 

ue a ger beam of electrons through lenses, we 


are able to see the surface. However. 
% this procedure cannot always be used: 
buy 


Foal 


J when there is low emission, when 
“the sample cannot be heated, etc- 

" ; Under such circumstances, the rep- 
% ir lica method is used. In this method, 
a an object iscoated with a thin layer 

= = of substance, which may be sepa- 
a rated from the object and examined 

in the opening of an electron mi- 
croscope. Experiments show that 


XE of. , such layers consisting of any one of 
ta variety of substances, e.g., organic, 

Fig. 203 metallic and quartz, form exact rep- 
licas-of the surface under investiga- 


; tion. A photograph of the surface of 
frosted glass obtained in this manner is shown in Fig. 203. The replica 
method requires meticulous experimentation. It is no easy task to . 
separate the layer of substance from the object. One of the methods 
used is to dissolve the object without damaging the film. 


173. Electron and Ion Projectors 


By means of an electron microscope, it has become possible to 
perceive large molecules as distinct spots or points. But the means 
are available to achieve considerably more, namely, the shape of 
a molecule may be discerned and a picture of its electron cloud 
obtained. This has been accomplished by means of special micro- 
projectors. , 

Fig. 204 is a diagrammatic representation of an electron and ion 
microprojector. This consists of a vessel evacuated to 10-§ mm of Hg 
and containing electrodes. The cathode has the shape of a spike the 
point of which has a very small radius of curvature. It is possible 
to create near a cathode having this shape a field of the order of 
10° volts/em. For such a field, electrons are torn away from a cold 
cathode in a radial stream. If an obstacle is located in the path of 
the stream, a dark image appears on the fluorescent screen (or pho- 
tographic plate). If an object lies on the surface of the point, the 
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magnification is equal to the ratio of the distance between the point 
and the screen lo the radius of curvature of the point. Using special 
means, the radius of curvature may be made less than 200 A. 

If molecules of a substance are placed on the point, their images 
appear on the screen. This has been done with phthalocyanide mole- 


cules, the dimensions of which are about 15 A. The form of the 


Fluorescent -20kv 


Screent 


Annular electrode 


Palladic tube 

A H,- flame 
0123 4 §¢em 
a a a 


To pump 
Fig. 204 


molecule, its characteristic four-petalled structure, and the concen- 
tration and rarefaction of the electron density were clearly visible 
on the screen. 

Although this method can certainly not be used for all objects 
under normal laboratory conditions, the possibilities of a method 
yielding a useful magnification of more than one million should not 
be underestimated. 

But the resolving power may be increased by yet another order of 
magnitude and, moreover, the clarity of the image may be consider- 
ably improved. This may be accomplished by using an ion beam 
instead of an electron beam, but otherwise employing the same prin- 
ciple of object examination. An ion projector does not differ in prin- 
ciple from an electron projector. The point is given a positive poten- 
tial and when the field is large (10® volts/cm) ions may be torn away. 
For this purpose, it is necessary that atoms or molecules be adsorbed 
by the surface of the point either beforehand or during operation of 
the projector. In the instrument shown in Fig. 204, a small quantity 
of hydrogen molecules is introduced into the vessel by means of 
a palladic tube. As soon as neutral atoms (or molecules) settle on 
the surface of the point they give up an electron and then, as positive 
ions, move toward the screen. 
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By means of such an ion projector, it has been possible to oblain 
the image of a tungsten point itself. An image arises owing the fact 
that adsorption of atoms occurs in specific parts of a tungsten crystal. 
In the obtained image, it was possible to discern the lattice period. 


i.e., the resolution attained equalled 2-3 A. 


174. The Electron-Beam Tube 


An electron-beam tube is a widely used device, being an essential 
component of a television set, radar system and oscilloscope. The 
principle of operation of such a Lube may be explained by means of 
‘the simplified diagram shown in Fig. 205. We see in the figure an 
electron gun (Z) and two condens- 
ers (2) for deflecting the electron 
beam in two mutually perpen- 
dicular directions. 

Let us consider the application 
of an electron-beam tube to the 
recording of rapid processes. If no 
voltage is applied to the deflecting 
plates, the electron beam is direct- 


; ed along the axis of the device 
anda spot appears on the luminescent screen. Assume that an alternat- 


ing voltage having a frequency greater than 20 cps is applied to the 
horizontal deflecting plates. Then, the electron beam begins to oscil- 
late in the vertical direction in synchronism with the varying field. 
Since electrons have very little mass, these oscillations will have 
practically no inertia.* The motion of the beam is not perceived 
because the luminous spot moves too rapidly for the human eye to 
follow; moreover, the screen has an afterglow. 

Now, let us consider the second pair of plates, which provide the 
so-called “sweep”. A saw-tooth vollage is applied across these plates. 
Since this second pair of plates deflects the beam horizontally, the 
luminous spot moves, say, from left to right quite uniformly under 
the action of such a voltage. When the edge of the screen is reached, 
the luminous spot rapidly returns to its initial position and the proc- 
ess begins anew. By changing the frequency of the saw-tooth voltage 
within a broad range of frequencies, we can vary the time scale of 
the horizontal sweep accordingly. 

If a sweep voltage is applied to the horizontal deflecting plates 
and the voltage being investigated is applied to the vertical deflect- 


Fig. 205 


* This absence of inertia is determined by the axial velocity of the electrons. 
Therefore, to record very rapid processes, high voltage oscilloscopes are used- 
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ing plates, a curve of voltage vs. time will be obtained on the screen 
since the horizontal coordinate of the luminous spot is proportional 
to the time reckoned from an arbitrary instant. 

An oscilloscope is particularly useful in the investigation of period- 
ic processes. It is always possible to select the sweep in such a man- 
ner that the curve described by the beam during one run from left 
to right coincides with the curve described during the second and 
successive runs. When the sweep period is fixed, we obtain a stationa- 
ry curve of voltage as a function of time in a given time interval 
(from a fraction of a period to several periods). 

A saw-toothed voltage is produced by a self-sustained oscillatory 
process analogous to that described on p. 100 (the toppling of a tub of 
water). The movement of the beam from left to right is produced by 
continuously and uniformly charging a condenser.* A discharge tube 
is connected to the terminals of the condenser. As long as the poten- 
tial difference across the tube is less than the ignition potential, the 
presence of the tube does not affect the charge on the condenser. 
When the potential reaches a critical value,-the condenser rapidly 
discharges and the process begins anew. The saw-toothed oscillations 
must be synchronised with the periodic process under investigation. 

An electron-beam tube becomes more complex when a modula- 
tor is placed between the cathode and the anode. Such a modulator 
consists of a metallic cylinder, one end of which is covered with 
a diaphragm containing an aperture equal to the size of the cathode. 
A negative potential applied to the modulator makes it possible 
to control the intensity of the beam. At a certain value of voltage 
(the blanking voltage), the beam is completely cut off. Such blanking 
is necessary, for example, during the return trace’of the beam. Thus, 
by means of the modulator, the trace of the beam is blanked during 
the return sweep. 

Two variable quantities may be viewed simultaneously on a screen 
if an electron-beam tube is equipped with an electron switch that 
alternately connects the deflection mechanism in one measuring 
circuit and then in another. Double-beam oscilloscopes have been 
developed for this same purpose. Such an instrument is equipped 
with an electron-beam tube having two independent electron pro- 
jectors and deflecting systems. A double-beam oscilloscope has in 
addition two separate amplifiers for the voltages being investigated 
and two saw-toothed voltage generators. 

The choice of a proper luminescent screen for an electron-beam 
tube is of great importance. For certain purposes, long-persistent 
screens are desirable, while for others it is required that the lumi- 
nosity disappear as soon as the beam is switched off. 


* A condenser charges and discharges exponentially, but by using a small 
portion of the exponential curve these processes may be made quite linear. 
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Single-pulse processes may be recorded with an electronic oscil- 
loscope if it is equipped with an auxiliary camera the shutter of 
which is synchronised with the sweep. This makes it possible to 
photograph the screen at the required instant. 


, 175. Mass Spectrograph 


The fundamental equation of motion of a charged particle, 
2p 1 
mga = (E+—[vB)) , 
shows that the path of a charged particle is determined by —, the 
m 


ratio of the charge of the particle to its mass. Therefore, measure- 
ments of the deflection of a charged particle in an electric and amag- 


netic field may be used to find =. Since the initial velocity of a par- 


: . : e 
ticle is not known, = cannot be determined by measuring the dellec- 
tion in either an electric or magnetic field alone. The general formu- 


8- direction away from us 


Stream of 
positive 
charge 


Fig. 206 


las for deflection in electric and magnetic fields (Sec. 167) show that 
the path is determined by coefficients containing - and the initial 


velocity. The problem is solved by measuring the deflection of one 
and the same particle in an electric and a magnetic field. 

In the simplest case, it is sufficient to balance the electric and mag- 
netic deflections. For this purpose, the fields should be oriented as 
shown in Fig. 206. Charged particles will not be deflected when the 
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following condition is satisfied: e& = tei. This experiment 


enables us to determine the velocity of a particle. Now, it is merely 
necessary to measure the deflection produced by the electric field 
or by the magnetic field alone. Knowing the magnitude of the deflec- 


tion of a particle from its rectilinear path, we may calculate ~. 


Measurements of — are of great importance in atomic physics 


as a means of determining the mass of a particle when the charge is 
known. This pertains particularly to the determination of the mass 
of ions. 

An instrument in which the particles of a beam may be sorted 
according to mass, and the composition of the beam according to 


Fig. 207 


mass may be investigated, is called a mass spectrograph. The mass 
spectrograph proposed by Aston is represented schematically in 
Fig. 207. Its principle of operation may be explained as follows. 
Particles of various velocities are introduced into the electric field 
of a condenser. Consider a group of such particles having the same 


< ratio. Upon entering the electric field, a stream of these particles will 


be divided since fast particles are deflected less than slow ones in an 
electric field. Now, this spread of particles is introduced into a magnet- 
ic field (perpendicular to the page). The sense of the field is such that 
the direction of deflection of the particles is opposite to that in the 
electric field. Here, too, fast particles are deflected less than slow 
ones. Hence, it follows that at some point beyond the field the divid- 
ed beam of particles will again gather at a point, i.e., become 
focussed. 


Particles having a different < ratio will also gather at a point, 
but not at the same one. Calculations indicate that the foci of all 
-. lie approximately on a straight line. If a photographic plate is 


placed along this line, each group of particles will be represented by 
a separate line. 


30-1409 
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If a mass spectrograph is constructed with great accuracy, ils 
resolving power will be extremely high and it can be employed to 
detect the presence of very close isotopes. At first glance, such pre- 
cision may appear unessential since it may be reasoned that the 
masses of isotopes differ by at least one atomic weight unit. But 
while it is true that isotopes of one and the same chemical element 
differ by an atomic weight unit, isotopes of different elements (e.g., 
S*° and A®*) may differ very little in mass. Moreover, it is important 
to be able to determine the mass of complex ions. Such problems 
arise, for example, in connection with the chemical analysis of gas 
mixtures. Different particles may then turn out to be close in mass, 
e.g., CH, and N™ or NH, and O%. All such problems may be 
solved by means of a mass spectrograph. 


176. Accelerators of Charged Particles 


Actually, all such devices as electron tubes, X-ray tubes and elec- 
tron guns are accelerators of charged particles, but this term general- 
ly denotes installations producing streams of charged particles 
(electrons, protons, deuterons, etc.) moving with velocities close 
to the velocity of light. Such streams of particles are then allowed to 
impinge on matter. The interaction achieved may be used for various 
purposes: investigation of nuclear transformations, production of 
radioactive isotopes, medical purposes, chemical action, etc. The 
role of accelerators in modern science is an extremely important one. 

It is possible, of course, to accelerate a particle to any desired 
energy by making it pass through successive accelerating fields. 
However, such a brute force approach, i.e., the construction of a 
linear accelerator, is not always practical. To accelerate particles 
to energies of only tens of thousands of electron-volts requires a path 
length of the order of many centimetres. But modern physics is 
striving to obtain streams of particles with energies of tens of billions 
of electron-volts. Such a linear accelerator would have a length of 
tens of kilometres. Practically, therefore, such a solution to the 
problem is unacceptable. ' 

Lawrence was the founder of high-energy accelerators. His basic 
idea is that in a single installation particles should be accelerated 
by an electric field and repeatedly made to return to the same accel- 
erating gap by means of a magnetic field. The first accelerators 
operating on this principle became known as cyclotrons. 

A cyclotron is represented diagrammatically in Fig. 208. The 
accelerating chamber may be pictured as a flat circular pill box cut 
along a diameter. It is of large dimensions, made of metal, and its 
two halves—the basis of the accelerating chamber—are known as 

Dees. An alternating electric field of period 7 is applied to the Dees. 
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This entire system is placed between the poles of an electromagnet, 
which creates a strong constant magnetic field inside the box per- 
pendicular to its base. 

The magnetic field intensity is determined by the period of the 
electric voltage. That is, the value of the field intensity must be such 
that the period of revolution 


in this field (expressed by the JDeflecting 


formula JT | is equal to 
the period of the electric field. — Ardits 
When this condition is satisfied, ; 
charged particles entering the 
accelerating chamber are cap- 
‘tured by the fields and accel- 
erated spirally with constant 
period 7. Thus, a particle in 
the gap between the Dees is 
accelerated, traverses half a 
circle in the magnetic field and 
arrives at the accelerating gap 
just when the voltage phase Fig. 208 
changes by 180°. Thereupon, 
the particle is accelerated in the same direction and enters the 
other Dee where it traverses half a circle of largerradius. Each time 
the particle passes through the gap its velocity increases. The particle 
must be deflected from its circular path to eject it from the cyclotron. 
A cyclotron has limited usefulness. As the velocity of a particle 
increases its mass increases and, hence, its period of revolution in 
the magnetic field also increases. As a result, the particle begins to 
lag, i.e., it arrives at the accelerating gap when the phase of the 
voltage has changed by more than 180°. This lag increases until, 
finally, the electric field not only does not accelerate particles, but 
retards them. Calculations indicate that the maximum energy that 
a cyclotron can transmit to a charged particle is given by the formula 
D) / evgme? 
4 


Vacuum chamber 


Radio- frequency , 
generator 


. For protons this amounts to 22 million electron-volts 


(22. Mev), and for a-particles about three times more. New means 
are necessary to attain higher energies. 
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Veksler in the Soviet Union and McMillan in the United States 
elaborated a new concept in the accelerator field in 1944 and 1945, 
respectively. This concept may be summarised as follows: Examina- 


30* 
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tion of the formula for the period of revolution of a particle shows 
that increased mass may be compensated for by an increase in mag- 
netic field. If such compensation takes place, the period of revolution 
of.a charge will remain constant. 

Let us assume that the magnetic field intensity increases during 
the operation of the cyclotron. Among the thousands of millions of 
particles moving in the accelerating chamber, there are undoubtedly 
some particles whose increase in mass due to increased velocity is 
exactly proportional to the increase in magnetic field. Hence, the 
period of revolution of such particles will remain unchanged. Calcu- 
lations indicate that under such conditions other particles will also 
not fall out of synchronism. The only difference, which is quite un- 
important in practice, is that the energy of these particles will not 
increase in a monotone manner, like the energy of the “lucky” par- 
ticles whose increase in mass is completely compensated for by the 
increase in magnetic field, but will fluctuate about the energy value 
of the “lucky” particles. 

The orbit radius of the “lucky” particles agrees with their energy. 
This is the reason for their “good fortune”. Now, let us consider a 
particle the energy of which is greater than that required for a given 
radius. Such a particle will be retarded owing to an excess of mass 
increment. On the other hand, if a particle has less energy than 
required for a given radius, the mass increment will be insufficient 
and the particle will be accelerated relative to other particles at 
the same radius. Thus, since the mass of a particle increases with 
velocity, particles can, so to speak, regulate their own velocity and 
select voltage phases which serve to correct their motion. That is 
why this phenomenon is referred to as “auto-phasing”. 

It transpires, therefore, that it is possible, in principle, to increase 
the velocity of particles in a cyclotron without limit if the magnetic 
field intensity is gradually increased. In such an installation par- 
ticles must be accelerated in pulses. When the field increases parti- 
cles are accelerated, but the reverse cycle is idle. 

The above method is not the only way to achieve auto-phasing. 
Another approach is to slowly vary the period of the electric voltage. 
The principle is basically the same: an increase in the mass of a 
charged particle results in an increase in the period of revolution in 
the magnetic field; in this case, the regime of the alternating electric 

field is varied so as to compensate for this increase. An accelerator 
in which the period of the electric voltage is slowly increased jis 
known as a synchrocyclotron. The path of a particle in a synchro- 
cyclotron is a flat spiral. The farther this spiral extends, the greater 
the energy of the particle. Thus, an increase in energy is associated 
with an increase in the area of the accelerating chamber in the mag- 
netic field. The most powerful accelerator of this type is the synchro- 
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cyclotron of the U.S.S.R. Academy of Sciences. This accelerator 
yields a beam of 680 Mev protons. Its magnet weighs 7,000 tons. 
- The energy limit of a synchrocyclotron is of the order of hundreds 
of Mev since a further increase in energy would result in an unthink- 
able increase in magnet weight. Particle energies of thousands of 
millions of electron-volls (Bev) are atlained by means of proton 
synchrotrons. , 
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Proton synchrotrons are basically different from cyclotrons. Since 
a proton synchrotron accelerates particles in a single circular orbit, 
the volume of the magnetic field may be greatly reduced. The entire 
central region of the magnet is, so lo speak, cut out. As a result 
much less steel is required for the magnet. Thus, the electromagnet 
of the 10-Bev proton synchrotron of the U.S.S.R. Academy of Sci- 
ences weighs 36,000 tons, but the electromagnet of a 10-Bev synchro- 
cyclotron would weigh about 1 million tons. 

To accelerate particles in an orbit of constant radius, one must 
vary the period of the accelerating electric field and the intensity of 
the magnetic field in a very definite manner. In such an installation, 
particles are accelerated in pulses. Each pulse is obtained by increas- 
ing the magnetic field and decreasing the period of the accelerating 
electric voltage in a prescribed manner. 

For a given orbit radius, a unique relationship exists between the 


field intensity and the velocity: 


Sf a eh 
V'-5 : 
and the relationship between the period of revolution-and the veloci- 
ty is determined by the expression 
2nur 


r= * 


v 


If these conditions are satisfied, a “lucky” particle will be accelerated 
in a monotone manner. 

Since the auto-phasing conditions occur here too, the remaining 
particles follow a path which oscillates about the circular orbit and 
they also take part in the synchronous increase in velocity. Since 
particles oscillate about an average circular orbit, it is necessary 
to make the width of the pathway for the charged particles rather 
broad. Methods are being sought which would allow us to reduce the’ 
width of this pathway. Success would lead to a reduction in the 
amount of steel required for a given accelerator and make it practical 
to construct accelerators with even higher particle energies. 
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Thus far we have been discussing accelerators of heavy particles. 
Electron accelerators have a number of special features. 

As far back as 1941, an accelerator known as a betatron was con- 
structed to accelerate electrons. Such an installation operates on the 
principle of a transformer. The winding of a magnet constitutes the 
primary winding and a beam of electrons revolving in a circle of 
constant radius constitutes the secondary “winding”. In other words, 
the electrons move along a circular line of force of the rotational 
electric field produced by an alternating magnetic flux. 

At first glance, it appears that such acceleration may continue 
without limit. The increase in mass with velocity does not limit the 
acceleration since there is no need for synchronism in this phenome- 
non. Nevertheless, a betatron does have a limit. At energies of sever- 
al hundred Mev, energy losses due to radiation become considera- 
ble—an accelerated electron radiates electromagnetic waves. As a result 
of this radiation, the electron path is transformed from a circle to an 
inward spiral and acceleration becomes impossible. Betatrons are 
useful when electrons with energies from 20 to 50 Mev are required. 
If greater energies are desired, electron synchrotrons must be used. 
Such accelerators were first proposed in 1946-47. 

The electron] synchrotron is similar to the synchrocyclotron de- 
scribed above, i.e., it isa resonance accelerator. An accelerating elec- 
tric field is added to the magnetic field of the betatron. The accel- 
erating mechanism is maintained by auto-phasing. But the fact that 
we are dealing with lighter particles, namely, electrons, simplifies 
the construction problem. This is because at energies of only 2 to 
3 Mev the electron velocity is almost equal to the velocity of light. 
Hence, when the energy increases further, the radius of the electron 
path does not change. This makes it possible to construct the magnet 
in the form of a ring as in the case of the proton synchrotron. The 
radiation losses of the electron synchrotron are compensated for by 
increasing the accelerating voltage. 

At- high-energy levels, radiation losses reach imposing values. 
In a 300-Mev accelerator having an orbit radius of 1 metre, an elec- 
tron radiates 1,000 ev per revolution. In a 10-Bev electron synchro- 
tron having an orbit radius of 20 metres, the energy losses per revolu- 
tion would be equal to 30 Mev. 


CHAPTER XXVII 
THE WAVE PROPERTIES OF MICROPARTICLES 


179. Diffraction of Electrons 


Fig. 209 shows X-ray and electron patterns of scattering from the 
same substance. The close similarity of the patterns indicates that 
diffraction also occurs in the case of electrons. If the wavelength 
4 is known, one may determine from a roentgenogram the values of 
the interplanar distances by means of the equation nA = 2d sin 0 


Fig. 209 


(see p. 385). We can calculate 4 by measuring the angle 6 of all the 
tings on the electronogram and using the values of d determined from 
the roentgenogram. The same value is obtained for each ring. This 
shows that the pattern is produced by diffraction and that a specilic 
wavelength is associated with a beam of electrons. 


In order to obtain such a pattern, one must place a thin film of matter in 
the path of an electron beam. Electrons are_easily absorbed by matter and 


472 The Wave Properties of Microparticles 


will not pass through a film the thickness of which is more than about 1075 cm. 
Electronograms may be obtained by means of an electronograph, an instru- 
ment similar to an electron microscope. In fact, one may use an electron micro- 
scope to obtain diffraction electronograms. For this purpose, it is merely 
necessary to remove the lens. 

Electron diffraction is used for the same purposes as X-ray diffraction, but 
electronography has a number of advantages over roentgenography. The main 
advantage is the short exposure time required. Matter scatters electrons much 
more effectively than X-rays. An electronogram may be obtained in a period 


of time measured in seconds, while a roentgenogram- requires at least several 
minutes. 


Since electron beams do not penetrate matter to any extent, electronography 
may be conveniently employed to investigate the structure of surfaces. Elec 
tronography may also be used to study the distribution of atoms in crystals, 
assuming, of course, that the structure is not complex. 


We are not so much interested in the applications of electrono- 
graphy as in electron diffraction itself. It is of great importance to 
determine the wavelength of an electron beam. This may be done 
experimentally. When the accelerating voltage U is varied, the 
wavelength varies. It turns out that % is inversely proportional to 
VU. Thus, when 2 is expressed in Angstroms and U in volts, 


a 
hw 112-25 
U 


As far back as 1924, before the discovery of electron diffraction, 
Louis de Broglie advanced a bold hypothesis. He proposed that the 
concept of the dual nature of an electromagnetic field, its manifesta- 
tion in the form of a wave of frequency v and wavelength A as well 


as in the form of particles (photons) with an energy E = hv and 


a momentum p = uae = 4 , should be broadened to include particles 


of matter. Experiments in electron diffraction confirmed this hy- 
pothesis. The formula for the electron wavelength 


is easily converted into the form 


* 
peer aitedd » where const* =——., 
VU 2em 


: peers 
by means of the relation “> =eU or v= / Zev. 


Substituting the values of e, m and h in the CGS system of units, 
and converting U into volts, we obtain the above value for the con- 
stant. 


180. The Fundamental Concepts of Quantum Mechanics 473. 


By means of the above formula, one can obtain the following electron wave- 
lengths for the indicated values of .accelerating potential: 
U (volts) 1 100 1,000 
y (A) 42.25 1.22 0.39 


Since diffraction [rom a crystal is possible when the wavelength is of the 


same order as the period of the crystal lattice (~1 A), it is seen that electrons. 
the energies of which are of the order of a hundred electron-volts may be diffract- 
ed from a crystal. 


180. The Fundamental Concepts of Quantum Mechanics 


As we have seen, an electron beam may behave as a wave having 
a wavelength 1 = : . Is this behaviour typical of a large group of 
electrons or are wave properties inherent in single electrons? To 
answer this question, let us compare the electronograms obtained 
with a low and a high intensity beam of electrons. In one such exper- 
iment, the beam was of such low intensity that the average time 
interval between two successive passages of an electron through the 
diffracting system was 30,000 times greater than the time taken by 
an electron to travel from the filament to the photographic plate. 
Nevertheless, the diffraction pattern differs in no way from another 
electronogram oblained with a beam the intensity of which was 
10? times greater. Such experiments show that the wave property is 
inherent in single electrons. Thus, what was stated on pp. 438-39. 
with respect to a photon is also valid for an electron. An electron 
does not behave like a projectile or pellet. The behaviour of an 
electron cannot be described by means of Newton’s laws of mechan- 
ics. The field devoted to the investigation of the behaviour of 
microparticles is known as quantum mechanics. 

This dual behaviour is not peculiar to an electron alone. Wave 
behaviour is characteristic of all microparticles. Thus, it is possible, 
for example, to observe neutron diffraction. Diffraction of helium 
atoms and hydrogen molecules have also been observed. As will be 
seen below, the greater the mass ofa particle the more do its wave 
properties recede into the background. But more about this later. 
Let us designate by p the amplitude of the wave associated with 
a microparticle of mass m. This amplitude, like the amplitude of 
a wave of any kind, is a function of coordinates. The function (z, y, 
z) or “psi function”, must satisfy the wave equation (see p. 341): 


Atp + = 0, 


17 
Let us substitute in this equation the microparticle wavelength 


N= me , expressing the velocity of the particle in terms of its energy. 
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If the particle moves in a field with a potential energy U and has 
a total energy @, then 


mv2 


5-=6—U and v= 2-0). 


Substituting in the wave equation, we obtain 


822m 


Ap +—s- (6 —U) p=0. 


This equation is called Schrédinger’s equation and is the fundamental 
law of quantum mechanics. It should be noted that we have merely 
performed a substitution in the wave equation, which by no means 
constitutes a derivation of the fundamental law of quantum mechan- 
ics. The substitution should be viewed rather as a conjecture leading 
to the discovery of this equation. 

Much like Newton’s fundamental law of motion, the Schrédinger 
equation is a law of nature encompassing, as we shall presently see, 
an extensive range of phenomena.* 

This differential equation enables us, in principle, to find at all 
points in the region under consideration the amplitude p (x, y, 2) of 
the wave associated with a microparticle. 

How can the validity of the Schrédinger equation be verified? 
This may be done by means of a basic confirmation of the theory: 
the intensity of the p-wave at every point in space, i.e., the quantity 
‘p’, is the probability density** of an electron at this point is space. 
As for the amplitude of the tp-wave, it cannot be determinéd experi- 
mentally, like the field intensity vectors of an electromagnetic wave. 

The description of a particle by the y-function is not to be regarded 
as an incomplete, imperfect method of describing its motion. It 
would be incorrect to believe that it is solely due to the peculiarities 
of quantum mechanics that a particle has a probability w? (z, y, z) 
of being at a given point in space and that by means of a better 
theory the path of the particle could be determined and its location 
indicated with certainty. An exact description of the motion of 
a particle, i.e., the determination of its path, is not possible because 
the behaviour of a microparticle is completely different from that 
of a large body. A microparticle is not a particle in the classical 
sense. 

Let us again refer to the experiment with two slits which was used 
to illustrate the dual behaviour of a photon. 


* We have simplified the picture by not considering the dependence of p 
on time. The exact Schrédinger equation takes this dependence into account. 
** That is, the probability of finding the particle in a small volume divid- 

ed by the magnitude of this volume. 
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Assume that a beam of electrons impinges on a screen in which 
two slits are cut. Diffraction occurs. Let us direct our attention to 
one point on the screen where, say, interference annuls the wave. 
If only one slit is kept open, electrons reach this point. If both slits 
are open, however, electrons do not reach this point. This phenome- 
non cannot be explained by the collective behaviour of the electrons. 

If we insist on the electron’s behaviour as a classical particle, it 
must be accepted that an electron passing through one slit “knows” 
whether the other slit is open or closed, and behaves’ accordingly. 
It must be concluded that an electron is not a classical particle. 
Every electron has wave properties and the concept of an electron 
path is not fully applicable. Therefore, the question whether the 
electron passed through one slit or the other when both slits were 
open is meaningless. Such a question is meaningful only for “ordi- 
nary” particles, but not for microparticles. 

Does this mean that it is meaningless to speak of the velocity and 
coordinates of a microparticle? This question is answered by the 
so-called indeterminacy (or uncertainty) principle formulated by 
the German physicist Heisenberg. 
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This principle indicates the limits within which the classical 
description of ‘particles applies to microparticles. 

Applicability of the Path Concept. Let us assume that we wish 
to determine the coordinate of a microparticle at. some point x and 
are able to do this with an accuracy of Az. To “see” a microparticle, 
one must use a “microscope” in which the wavelength of light is 
sufficiently short (the shorter the wavelength the greater the resolv- 
ing power). In principle, Az may be made as small as desired. For 
this purpose, it is merely necessary to reduce the wavelength until 
Az is of the same order of magnitude as the wavelength: Az ~ i. 


However, if the wavelength is short, this means that the corre- 
. I . 
sponding photon has a large momentum: p =z: This momentum 


will be transmitted to the particle being “observed” under the micro- 
scope, i.e., when the particle is “flicked” its momentum changes 


by a Ap the order of magnitude of which is + . By decreasing 4, we 


decrease Az, the uncertainty of the coordinate, but at the same time 
we increase Ap, the uncertainty of the momentum. Eliminat- 


ing 4 from the following relations: Ax ~ 4 and Ap = us , we obtain 
the equation 
Az x Apxh, 
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the expression for the uncertainty principle. This indicates that spec- 
ifying the path of a microparticle has physical meaning only if il 
is understood that the coordinates and momentum in a given direc- 
tion have uncertainties which satisfy the inequality 


Az x Ap>h. 


This remarkable relation establishes the limits of applicability of 
classical physics to a microparticle. 

Substituting the product mv for the momentum p, which is justi- 
fied in the case of velocities which are not very close to the velocity 
of light, we obtain the condition 


Az x Av> ge ; 
: m 
a relation between the uncertainties in a coordinate and the corre- 
sponding particle velocity along the z-axis. The right member of the 


relation will vary by many orders of magnitude, depending on 


Hera we are dealing with an electron, atom, molecule or tennis 
all. , 


For an electron 
h 


a 7 cm? /sec, 


hence, the uncertainties in a coordinate and the 
ticle velocity are related as follows: 
Az X Av>7. 

Consider an electron located within an atom, the diameter of which 
is 10-§ cm. Can the motion of the electron in the atom he described 
as if the electron were an “ordinary” particle? Using the uncertainty 
principle, we find that Av ~ 108 cm/sec. Thus, we can only speak of 
the velocity of an atomic electron in very general terms. The con- 
cepts of an electron path in an atom, an electron path of transition 
from one energy state to another (see below), etc., are meaningless. 
In short, an atomic electron is quite different from an “ordinary” 
particle. 

Now, let us assume that an electron has entered a Wilson cloud 
chamber and that we wish to trace its path with an accuracy of the 
. order of several tenths of a millimetre. If the width of the particle 
track is Az = 10-* cm, then according to the uncertainty principle 
Av =~ 7 m/sec. This is the uncertainty of the lateral component of 
the velocity. Even if the electron velocity is only 1 km/sec an uncer- 
tainty of the indicated order is insignificant and specifying the 
electron path becomes meaningful. Similarly, we can speak of the 
path of an electron beam in a microscope and the path of electrons 
in an electron-beam tube without coming into conflict with the 
“classical” picture. 


corresponding par- 
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The mass of protons, neutrons, atomic nuclei and atoms is thou- 
sands of times greater than that of an electron. Therefore, the classi- 
cal paths of such particles are of somewhat greater usefulness. Thus, 
for example, in the case of an a-particle, the mass of which is about 
7,000 times greater than the mass of an electron, 


Az xX Av > 10°3. 


ls it meaningful to ask at what location in an atom did the path 
of an q@-particle penetrating a substance pass through the atom? 
We wish to trace the path with an accuracy of 10-° cm and we have 
at our disposal data on the Jateral component of the velocity the 
uncertainty of which is 10° cm/sec = 10 km/sec. For a fast a-particle 
(20,000 km/sec), this uncertainty is insignificant. Therefore, it is 
possible to say whether the path of the a-particle penetrating the 
atom passes far from the centre of the atom or not. 

On the other hand, it is meaningless to speak of the path of pro- 
tons or neutrons in nuclei, since the size of a nucleus is ~ 10-¥ cm. 

For large molecules, e.g., proteins having a molecular weight of 
the order of 10%, the uncertainty principle is of no significance. 
Here, Ax x Av > 10-7; hence one can reliably define the path of 
such a molecule in considerable detail. Even the random thermal 
motion of such a molecule, the average velocity of which is of the 
order of 1 m/sec, may be traced, and the path of its centre of gravity 
indicated with an accuracy of the order of 41 A. 

Needless to say, a particle of dust, even if it is visible only under 
a microscope, is too large for the uncertainty principle to be of prac- 
tical significance. 

The Simultaneous Measurement of Two Physical Quantities. 
It should not be thought that the inability to determine the path 
of a particle is due to a measuring deficiency which will eventually 
be overcome by physicists. The lack of meaning in specifying a pair 
of physical quantities with ideal exactness is a peculiarity of micro- 
particles. The methods of describing the behaviour of an “ordinary” 
particle are inapplicable to a microparticle. Only for a classical par- 
ticle is it meaningful simultaneously to specify and define its coordi- 
nate and momentum. 

The. principle of uncertainty has broader significance than being 
simply a means of judging whether or not the path of a particle 
can be determined. As an integral part of the mathematical appara- 
tus of quantum mechanics, the principle of uncertainty enables us 
to evaluate the possibilities of simultaneous measurement of any 
physical quantities, not only coordinate and momentum. 

First, let us define the uncertainty principle in relation to coordi- 
nate and momentum. It should be recalled that a particle has three 
coordinates and that a momentum vector has three components. 
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Instead of the one relation discussed above, three relations should 
be written, namely: 


Az xX Apz>h; Ay x Apy>h; Az x Ap, >h. 


The possibility of simultaneously determining (specifying) all the 
coordinates along the different axes, and all the momentum compo- 
nents, is also considered in quantum mechanics. It turns out that it 
is possible (meaningful) simultaneously to specify the coordinates 
or simultaneously specify all three momentum components. Why 
is this point being emphasised? It would seem that it is always pos- 
sible simultaneously to determine the three components of any 
vector. Careful consideration indicates that this is nol so. An exam- 
ple of a vector the three components of which cannot be determined 
simultaneously is the angular momentum (i.e., moment of momen- 
tum) of a particle. 

Assume that a particle rotates about an axis with an angular mo- 
mentum L. This motion may be viewed as the resultant of three 
rotations about three mutually perpendicular axes with angular 
moments L,, L, and L,. ln the case of an “ordinary” particle, the 
three components of the angular momentum may be determined 
separately since the path of the particle may be traced. In the case 
of a microparticle such a determination is not possible, and simul- 
taneous specification of all three components of the angular momen- 
tum is meaningless. To clarify this point, let us assume the converse 
for a moment, namely, that all three components of the angular 
momentum are known. But then the total angular momentum could 
be constructed from the three components. In that case, the plane 
in which the particle moves is determined. But if this plane is 
known, then we know precisely the coordinate of the particle along 
the axis of rotation and note simultaneously that the velocity of 
translatory motion along the axis of rotation is equal to zero. This 
contradicts the principle of uncertainty relating coordinate and 
momentum. 

Thus, it is characteristic of a microparticle that it is not possible 
simultaneously to determine the three components of its angular 
momentum. What data relative to its rotation may be specified 
simultaneously? The uncertainty principle gives the following 
answer: any component and the absolute value (vector length) of the 
angular momentum. There is one exception to this rule: complete 
absence of rotation may be established for a microparticle, i.e., the 
angular momentum vector may equal zero; in other words, all three 
components equal zero simultaneously. “ 

Energy and Time ‘Interval. On the basis of the uncertainty prin- 
ciple relating the coordinate and momentum of a particle, one may 
suspect that a more or less analogous relation which involves energy 


i 
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exists. Thus, in ordinary particle mechanics, it was necessary to 
know simultaneously the position and velocity of a particle in order 
to calculate the total energy as the sum of potential and kinetic 
energy. For a microparticle this is not possible. However, the total 
energy of a particle may be found as a whole, i.e., without separa- 
tion into parts, and on the basis of what was just said it is natural 
to expect that this may be done with an uncertainty A@. If it is 
assumed that the uncertainty principle maintains its form, then 
from dimensionality considerations we must conclude that the uncer- 
tainty relation for energy must have the following form: 


AgAt <= h 


where At is the time interval. 

What is the significance of this time interval and how should the 
uncertainty relation for energy be interpreted? The time interval 
At is the time during which a microparticle possesses an energy 
G + A&. The uncertainty in the energy of a microparticle is deter- 
mined by the time during which it is in the given energy state. The 
uncertainty in energy becomes significant only when the time during 
which it is at the given energy level is measured in minute fractions 
of a second. 

An atomic electron remains for an indefinitely long period of time 
at its lowest, or fundamental, energy level (see below for a detailed 
discussion). Therefore, the energy of the fundamental state is fixed 
quite regidly. An electron remains for a very short period of time 
at a higher level. Its energy in such a state is G + A @. Accordingly, 
when an atom passes from a higher energy level to a lower one, the 
radiation frequency cannot be exactly defined, i.e., it lies in the 


narrow band v =: a6 . This may be observed experimentally: spectral 


lines are of finite width, which may be. used to determine the so- 
called lifetime of a microsystem in an excited state. Experiments show, 
for example, that the width of spectral lines in the X-ray region 
is of the order of 10 ev. Thus, in such a case, the lifetime in an excit- 


I 
ed state is of the order of aE 10-15 sec. 


182. The Potential Square Well 


On page 57, we discussed potential curves, which clegtt 
the conditions of particle motion. The simplest g¥¥aa\t 
is a right-angled curve, a so-called square well. Yi Satc 
potential energy has a constant value over a seg 
ity, we restrict ourselves to the linear case). | 
well, the potential energy changes abruptly. if 
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high, the potential energy inside the well may be considered equal 
to zero (since the choice of origin is immaterial), and at the edges 
of the well equal to infinity (see Fig. 210). 

Assume that an electron (or some other particle) is located in the 
well. Let us try to determine the nature of its motion for the simple 
one-dimensional case, i.e., let us assume that the electron is moving 
along the x-axis. If Newton’s laws of mechanics were applicable to 
an electron, such an electron would 
move continuously—lirst toward one 
side of the well, where it would be elas- 
tically reflected from the wall, then 
toward the other side, etc. There is no 
other possibility from the viewpoint 
of Newtonian mechanics, since for 


U = 0 the kinetic energy = is con- 


stant. Thus, for motion in a square 
well, the following conclusion may be 
drawn from the mechanics of “ordinary” 
particles: A particle may move in the 


- J . 2 
well with any kinetic energy or 


it may remain motionless. For any 
tgiven energy, the motion is uniform— 
first toward one side, then oward the other, i.e., the velocity re- 


verses direction abruptly at the end of the allowed interval. 

Now, let us consider the electron motion in such a well from the 
viewpoint of quantum mechanics. 

The behaviour of the electron is characterised by the »p-function. 
The square of this function indicates the probability of finding the 
electron at some point in the given interval. Since U = 0 inside 
the well, the Schrédinger equation becomes simplified and may be 
written in the form 


dp) | 472 
dx? + 42 


Fig. 240 


h 
amE ” 
This equation is satisfied by the sine and cosine of the argument 


— If the well is bounded by the coordinates = 0 and x =a, 


then for these values p = 0, since the electron does not penetrate 


the walls. Therefore, cos 2a = is not suitable as a solution to the 


equation (cos 20 = = 1 At gS 0). Hence, 


Y= 0, where \= 


Pa 
p= Asin =z. 
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But the wavelength 4 cannot be arbilrary, since p= 0 at x= a. 
The following equation must, therefore, be satisfied: 


2n 


a (n+1)x 


or 


2 

has? where n=O, 1, 2,.... 

Thus, the wavelength may assume the values 2a, a, a : + } baa 
It is seen that the w-function represents the amplitude of a standing 
wave (see p. 132) and that formally this problem has much in com- 
mon with that of a vibrating rod or string. But if the wavelength 
i has a discrete set of values, then the energy @ of a microparticle 
cannot be arbitrary, i.e., 


(n+ 12h? 
6= 8ma2 ? 


the integer n is called the quantum number. 

Thus, the Schrédinger equation leads to the quantisation of energy. 
A microparticle in a square well has a discrete set of energy levels. 
The lowest energy level occurs for n = 0. This energy is equal to 
a and is the zero-point energy of a particle located in a square 
well. 

The existence of a zero-point energy is an interesting peculiarily 
of microparticles. In the case of “ordinary” particles, the lowest 
energy has a value of zero. But under no circumstances can micro- 
particles cease to move. At absolule-zero temperature, a micro- 
particle has a specific zero-point energy, the values of which differ 
considerably depending on the nature of the force field in which 
the particle is located. 


Example. Assume that a=1A (a characteristic value for an atomic 
region). Then, the zero-point energy of an electron in the square well is 


4 + {0Q-27)2 
je ONE O10 Wore ar wi 


If a=1 cm (a [ree electron in a picce of metal), @9=0.6X10-26 erg =37x 
xX 10-16 ev. 


The velocity of an electron at a given energy level may be calculat- 
ed by means of the wavelength: =. But in such a case, the 
electron motion cannot he described by the equations of classical 


mechanics. It is not possible to indicate where the electron is located 
31-1409 
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at one or another instant of time. However, the quantily w?, i.e., 
the probability density of the electron at one or another point in 
space, may be determined from the equation 
Hh = AP sin? = 2 = AP sin? =(n+1) ce 
It should be noted that to each energy level there corresponds 
a proper (of same number n) wave function (also called charactcris- 
tic function or eigenfunction). 


Fig. 211 shows the w-function and w?-funclion for the first four 
energy levels of an electron located in a square well. Quantum 


pz) 


Fig. 211 


mechanics leads to the conclusion that an electron does not appear 
with equal frequency at different points in the region. If the electron 
energy is a minimum, i.e., the electron is located at the lowest 
level (x = 0), the particle is most often encountered at the centre 
of the “well”; if the electron is located in the state corresponding 
to n = 1, it is never encountered at the centre of the allowed seg- 
ment, etc. The p; curves indicate the frequency with which the 
electron appears at various points in the region. 

Now, let us summarise. On the basis of quantum mechanics, the 
following conclusions may be drawn regarding the motion of a mi- 
croparticle in a square well. Only motion corresponding to a discrete 
set of energy values, @o, 61, G2 ..-, is possible. The particle can- 
not be stationary since even the lowest energy level corresponds to 
motion with a certain velocity. Data on the nature of particle mo- 
tion for a given energy are provided by the p*-function. Knowing 
ap? (z), one may determine at what points in the region the micro- 
particle appears often, and at what points rarely. 
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lt remains to be determined under what condilions an “ordinary”, 
i.e., classical, description of particle behaviour is valid. 

Imagine that an oxygen molecule is enclosed in a box the dimen- 
sions of which are scores of times greater than the dimensions of the 
molecule. Assume that the molecule has the average energy, at room 
temperature, of a, molecule of oxygen gas, i.e., 107" erg. Using the 
values a = 100 A, m= 5.4 x 10° gm and ¢ = 10-™ erg, one 
may determine the quantum level of the microparticle. The result 
is n = 1,000. Two conclusions may be drawn. First, .the 1, curve 
has such a large number of alternating maxima and minima that 
the probability density of the particle is approximately the same 
for all points in the box. Secondly, neighbouring energy levels are 
very close. 

Both characteristics which follow from the fundamental equation 
of quantum mechanics have disappeared: the probability distribu- 
tion of the particle is practically indistinguishable from a smooth 
curve and the energy levels are so close that energy discreteness is 
imperceptible. In such a case, quantum mechanics yields approxi-. 
mately the same results as particle mechanics. This is true whenever 
the particle energy corresponds to a large quantum number. We 
have illustrated an important principle of quantum -mechanics: 
when the quantum number is large, the results of quantum mechan- 
ics coincide with those of the mechanics of “ordinary” particles. 
This means that when 7 is large the particle-path concept and other 
characteristics of ordinary particles are applicable to microparticles 
as well. 


183. Significance of the Solution of the Schrodinger 
Equation 


We have devoted a relatively large amount of space to the motion 
of a particle in a square well. On the basis of this simple example, 
we were able to illustrate the basic features of the quantum-mechan- 
ical method of considering problems. If an electron or other particle 
is able to execute motion in a limited region, the characteristic 
features of the solution of the Schrédinger equation are preserved, 
no matter what the form of the potential curve in this region. In 
all cases, the potential well may be intersected by a number of hori- 
zontal lines—possible energy levels. In principle, the Schrédinger 
equation enables us to calculate these energy values if the form of 
the potential well is given. The lowest level gives the zero-point 
energy of a particle in a given potential well. 

For each energy level of number n, quantum mechanics establishes 
a set of wave functions tp, (x, y, 2). The quantity pj, (x, y, 2) gives 
the probability of finding a particle at a given point in the region if 

31* 


\ 
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the energy of the particle is @,. Since the particle manages lo be at 
all points in the region more than once during the time of meas- 
urement, 1p? (z, y, 2) may be viewed as the density of the “particle 
cloud”. The electron cloud surrounding an atomic nucleus resembles 
a photograph of the atom taken with long time-exposure. The p-func- 
tion gives the amplitude of the wave associated with the particle. 
In the case of an electron in a square well, the p-function consists 
of standing waves and to every level there corresponds a character- 
istic wavelength 2X. 

This is not the situation in the general case. The standing “waves” 
corresponding toa given state (given n) will appear very strange: 
their wavelengths %——— + will differ at different points 
V 2m(g—U) 
in the region, depending on the nature of the potential “curve” 
U (zx, y, 2). For more or less complex cases, there is only slight simi- 
larity between the wp-function and the amplitude of a standing wave 
(in the usual sense of the term). 

Theoretical and experimental evidence indicate that in a number 
of cases several 1p,-functions may correspond to a single ¢, energy 
value. This occurs if at one energy a particle may have states which 
differ with respect to another physical quantity, e.g., angular mo- 
mentum. The forms of the wp? clouds of such a state—called a degen- 
erate state—may differ radically from one another. 

The problem of particle motion in a given type of potential well 
is solved when the energy levels are found and the characteristic 
ip-functions are calculated for all levels. If the solution of the Schré- 
dinger equation is known, the result of one or another measurement 
performed on the given particle may be predicted. 


184. Tunnelling Through a Barrier 


We shall now discuss a peculiar effect which may occur in the case 
of a microparticle, but not in the case of an ordinary particle. This 
is the tunnel effect, i.e., the “leakage” of a particle through a poten- 
tial barrier. 

Imagine that inside the region in which a particle moves, there 
is a potential barrier of height U and width d (Fig. 242). If the ener- 
gy of the particle is 6 < U an ordinary particle could be either in 

front of the barrier or beyond the barrier. The particle cannot pass 
through the barrier, since this would require that the particle have 
a negative kinetic energy and an imaginary velocity, which is absurd. 
Matters stand differently with respect to microparticles. The uncer- 
tainty principle does not permit us simultaneously to ascribe to 
a microparticle exact values of velocity and coordinate, and hence 


x 
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of kinetic and potential energy. That is why a particle having a total 
energy & may pass through the barrier. 

The conditions for such passage may be determined as follows. 
Coordinate and momentum uncertainties are connected by the’ rela- 
tion AzAp~h. Momentum uncertainty 
is uniquely related to kinetic energy 


uncertainty since K = se If AK isa 
quantity of the order of U — &, where 
& is the particle energy and U is the 
height of the barrier a particle to the 
left of the barrier (see figure) has a co- 


ordinate uncertainty 

bk 

Ap = V/2m (U—@) i 

If the barrier width d is less than Az, 
the particle may be located on the 
other side of the barrier. The particle tunnels its way, so to speak, 


through the barrier at the total energy level @. 
Thus, the condition for tunnelling is 


dV 2m (0—é)<h, i.e., 4 V 2m(U—@6@) <1. 


It may be easily shown by numerical examples that the phenomenon 
is of significance only for microparticles. 


For U—€=10 ev~10-" erg, m~ 10-2? gm ‘(electron mass) and d~ 


Ar~ 


~ 10-8 cm, £ V/2m (U—@) =0.2<1, i. e., tunnelling is possible. 

For a i-gm spherule lying next to a match box (U—'6 =3,000 ergs and 
d=2 cm), Vim Ua) =2.5x10 > 1. It is evident that the spherule 
cannot “tunnel” through the match box. 


The probability of leakage through a barrier may be calculated. 
It turns out that this probability is proportional to 


_ iz V2m(U—&)a 


The tunnel effect could be predicted on the basis of the Schré- 
dinger equation. The solution of this equation shows that even at 
points where U > & the »p-function has values differing from zero. 
Thus, there is a certain probability—inversely proportional to the 
magnitude of U — ¢ —that an electron is located in a region where 
in the language of “ordinary” particles it possesses negative kinetic 
energy. 


CHAPTER XXVIII 
ATOMIC STRUCTURE 


185. Energy Levels of a Hydrogen Atom 


A hydrogen atom has one electron which “rotates” in a nuclear 
field. One would think that the problem is simple. But even for this 
most simple atom, the solution of the Schrédinger equation is very 

é cumbersome and, therefore, 
will not be reproduced here. 
However, we shall give the 
, results of the calculations 
and discuss them in con- 
siderable detail. 

An electric force of Cou- 
lomb attraction acts  be- 
tween the electron and the 
nucleus. The potential ener- 
gy of an electron ina nu- 


clear field is u=—= 


where e is the charge of an 
Fig. 213 electron (the same as the 
charge of a proton) and 
r=Vxr2+ y> + 2" is the distance between the electron and 
the nucleus. 
The Schrédinger equation has the form 


’ 


8n2m 


; Ap + h2 (e+=) peu: 


Such an atom constitutes a peculiar kind of potential well and is 
illustrated in Fig. 213. This is a well without a bottom and with 
divergent sides. The sides of the well are hyperbolas and the axis 
r = 0 is one of the asymptotes. The electron inside the atom has 
a negative potential energy* since the minimum value of potential 


* It may be asked: why has the origin of potential energy been chosen in 
such a manner that the electron energy is negative? The advantage of such 
achoice is not difficult to see. For different atoms, the potential energy has the 
same value only when r — co. It is natural to set this common value equal lo 
zero. 


4 
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energy tends to infinity when r+ 0O and the maximum value is 
equal to zero. : 

Fig. 214 shows the energy levels obtained from the solution of the 
Schrédinger equation. An important feature of the solution is the 
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Fig. 214 


drawing together of the levels as the quantum number 7 increases. 
Transitions between levels will be discussed below. The scales of 
values, which are proportional to energy, are given in the units 
adopted in spectroscopy: volts and reciprocal centimetres. The energy 
level formula may be written in the form 


Er= 22me4 
On ~~ RRnd 


488 Atomic Structure 


For historical reasons, it is customary to write this formula in the 
form 
cRh 
6r= a ? 


n2 


2 
= = 109,740 cm-! is the Rydberg constant. 

Thus, it transpires that the total energy as well as the potential 
energy of the atomic electron is negative. The atomic electron may 
be located at any one of n levels. The greater the value of'n, the 
higher the energy level, and the greater the energy possessed by the 
electron. The electron of a free hydrogen atom on which no force 
acts is at the lowest energy level: G; = — cRh. 

If an energy greater in magnitude than cRh is transmitted to the 
atom the electron leaves the bounds of the potential well, i.e., the 
atom becomes ionised. The energy cRh is called the ionisation energy: 

It is customary to characterise the work of tearing away an elec- 
tron from an atom by the ionisation potential. For a hydrogen atom, 


V,,, — shh 3x 1010 x 109,740 x 6.6 x 10-27 
ela el 4.8 X 10-10 rs 


= 4.5 x 10°? statvolt = 13.5 volts. 


The reason for the above designation is the following. Let us as- 
sume that the electron is torn away from the hydrogen atom by the 
action of a beam of electrons. To ionise a hydrogen atom, one must 
accelerate electrons, which act as projectiles, to an energy of at 
least eV = cRh. Therefore, V is the potential difference through 
which an electron must be accelerated in order to produce ionisation 
of collision with a hydrogen atom. 

If the energy imparted to a hydrogen atom is less than cRh, 
a transition of the atom occurs to one of the n levels*. Such an atom 
is said to be in an excited state. 

An atom stays in an excited state for a small fraction of a second 
and then passes to a lower level with the emission of a photon in 
accordance with the equation 

hVmn = 6m—En=cRh (= =r) ‘ 


n2 m2 


where R = 


If hydrogen atoms are excited by different kinds of collisions, they 
are raised to different energy levels and return to the ground state 
by “skipping” over levels (see Fig. 214). Therefore, a large concentra- 
tion of hydrogen atoms will radiate photons of every possible Vmn 
frequency. A characteristic line spectrum of emission arises. 


* In the case of a hydrogen atom, which has one electron, the phrases “the 
atom is at an energy level n” and “the electron is at an energy level n” have 
the same meaning. 
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By calculating for a given m the v,, frequencies corresponding to 
the numbers 7 = n+ 1,n+2,... , weobtaina series of frequen- 
cies of lines in the hydrogen spectrum. The existence of such series 
was known long before quantum mechanics was developed. The 


Ml 


Fig. 215 


series corresponding to x = 2 is known as the Balmer series. By sub-: 
stituting m = 3, 4, 5, 6, 7 and 8, respectively, in the formula, let 
us calculate the wavelengths of six of the lines,in this series: 4; = 
= 6,562.80 A; A, = 4,861.38 A; A, = 4,340.51 A; 4, = 4,101.78 A; 
Az = 3,970.11 A: and Ag = 3,889.09 A. It is apparent that the 
separation between lines decreases as 7m increases, which is in accord- 
ance with experimental results (see Fig. 245). Experimental and 


calculated values do not differ by more than 0.05 A. 


186. Quantum Numbers 


The solution of the Schrédinger equation enables us to determine 
all of a hydrogen atom’s energy levels, €n, as well as all of its wave 
functions. In the ground state, an electron is characterised only 
by the function ,. As for the excited states, they are degenerate to 
the square of m, to use the terminology of quantum mechanics. This 
means that there are four p-functions corresponding to the energy 
G2, nine corresponding to @3, etc. Eeach of these states may actual- 
ly exist. 

How do the n? states having the same quantum number n differ 
from one another? Quantum mechanics provides the answer to this 
question. States with one and the same energy value &, may differ 
with respect to the magnitude of the electron’s angular momentum 
as well as the value of the angular momentuin’s projection on a cer- 
tain selected axis. 

The solution of the Schrédinger equation for a hydrogen atom 
shows that the electron’s angular momentum has a discrete series of 
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values given by the formula 
L=Vii+)>, 


where J may assume any integral value from 0 to 2 — 1 when the 
electron is at the n-th level. 

Moreover, the Schrédinger equation shows that relative to the 
selected direction z the angular moment Z must be oriented in such 
a manner that 

L, => mo . 
where m is an integer that may assume any value from —1 to -+J, 
including zero. 

It should be recalled that according to the uncertainty principle 
L and L, give us all that we can possibly know about the angular 
momentum; in other words, it is meaningful to specify simultane- 
ously only these two quantities. 

Thus, the state of an electron in an atom is characterised by three 
quantum numbers: n, 1 and m. The number n is called the principal 
quantum number, / the reduced azimuthal quantum number, and 
m the magnetic quantum number. 

The states with 1 = 0, 1, 2, 3, ... are designated by the letters 
s, p, d, f, ... , respectively. The principal quantum number pre- 
cedes one of the above letters. For example, the 3p state is the state 
with n = 3 and / = 1. 

Let us list all of the possible states for x = 1, 2 and 3: 


n | l 


Designation 


of the state _ 
1 0 1s | 0 
2 0 2s 0) 
4 2p —1, 0, 1 
3 0 3s 0 
1 3p —-1, 0,1 

2 3d —2, —1, 0, 1, 2 


The energy transitions of a hydrogen atom are determined exclu- 
sively by the values of the principal quantum number n. In order 
for the J and m numbers to play a part, the degeneracy must be 
“removed”, i.e., the energy of states with a different angular momen- 
tum must be changed. In the case of hydrogen atoms, this may be 
done by placing the atoms in a magnetic field. In other cases, de- 
generacy is removed by electron interaction (see below). 
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187. The Electron Cloud of s and p States 


The state characterised by the three numbers x, 1 and m is de- 
scribed by the wave function tpn, 7, ,- The characteristic form of the 
electron cloud corresponding to this state is determined by the func- 
tion 3,1, m. Let us consider the form of the tp*-functions of a hydro- 
gen atom which characterise its various excitation states. 

Consider the s states. Since 7 = 0, m is also equal to zero. Hence, 
for every 7 there is only one wp-function: The equation / = 0 means 


4nr?y? 


1 2 3 
Fig. 216 


that the electron has no angular momentum. This requires, of course, 
that there be no favoured directions of motion, i.e., the electron 
cloud must have spherical symmetry. Such is the result obtained 
from the Schrédinger equation: the functions ‘p15, P2s, Pas, etc., 
have spherical symmetry. 

Fig. 216 shows curves of radial density distribution of the electron 
cloud or, what amounts to the same, the probability density distri- 
bution of the electron. The quantity 4ar%p?, which gives the radial 
density, is plotted along the ordinate. It is evident that 4mr*p*dr 
represents the number of electrons* contained in a spherical shell 
the inner and outer radii of which are r and r + dr, respectively. 
The radial density curves show that in the 1s state there is one 
electron density maximum, which for a hydrogen atom is located 


* A fractional number of electrons should not disturb us, for this is only a 
manner of speech. Strictly speaking, 4r%ip? dr is the probability of an clec- 
tron being inside a spherical shell of dr thickness. 
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at a distance of 0.53 A from the nucleus. In the 2s state, there are 
two density maxima; the electron will be within the second mavxi- 
mum most of the time. Finally, in the 3s state, there are three den- 
sity maxima; here, the electron will be within the third maximum 
most of the time. 

As the principal quantum number n increases, the electron cloud 
becomes dissipated. ' 

The p-state functions look entirely different. There are three 
values of m, namely,“0, —1 and +4, corresponding to 1 = 1. The 


m=] : =+7 


Fig. 247 


electron-cloud configurations are illustrated in Fig. 217. For m = 0, 
the major axis of the “figure eight” is oriented along the selected di- 
rection; form = + 1, the major axis is perpendicular to the selected 
direction. It is evident that the m = + 1 states may be meaning- 
fully distinguished only when both are present. The figure gives 
some indication of the symmetry of the electron cloud. It is the 
same for all states. A change in the principal quantum number 
merely results in a change in the nature of the radial drop in den- 
sity: the greater the value of nm, the more extended the curve. 

We shall not discuss states with large values of J. Their electron 
clouds are more complex. 


188. Pauli’s Exclusion Principle 


Atoms are arranged in the Mendeleyev periodic table in accord- 
ance with the number of electrons contained in them. Thus, helium 
has two electrons, lithium three and beryllium four. On the basis 
of the Schrédinger equation, what can be said about the structure 
of atoms? 

At first glance the problem may appear hopeless. Even in the 
case of helium, strict adherence to procedure would involve solving 
the Schrédinger equation for a wave function with six variables, 
W (%1, Ys, 21, Le, Yo, Z2), the square of which gives us the probability 
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of finding the first electron at point 2, ys, 2, when the second electron 
is al point 22, Y2, Zs. The potential energy to be substituted in the 
equation is 


where 7, and rz are the distances of the electrons from the nucleus 
{the nuclear charge of helium is 2e) and ry2 is the distance between 
electrons. An exact solution of such a problem is quite impossible. 

It would be extremely desirable to deal separately with each 
atomic electron and describe each such electron by its wave func- 
tion wp (x, y, z). But how can this be done? Evidently, it is necessary 
to consider the motion of one electron in the field of the nucleus 
and the remaining electrons. It may be assumed that this effective 
field has spherical symmetry. Therefore, the description of the 
properties of such an electron will not differ from that of the electron 
of a hydrogen atom. 

To be sure, the problem is still quite difficult: for different elec- 
trons these effective fields differ and, moreover, all must be deter- 
mined simultaneously since each depends on the states of the 
remaining electrons. Such an effective field is said to be self-con- 
sistent. This approach to the problem of a multi-electron atom 
enables us to apply, to a large extent, the description of the prop- 
erlics of a hydrogen atom’s electron to the behaviour of an electron 
of a complex atom. ; 

The state of each electron will be characterised by the same quan- 
tum numbers as in the case of hydrogen. However, in the case of 
a multi-electron atom, degeneracy is removed by electron interaction, 
and levels with different / and m values will have different energies. 

The Schrédinger equation enables us to determine the energy 
levels that are possible, but does not indicate the energy of the 
atomic electrons. One might think that all the electrons of an atom 
occupy the lowest energy level. In any case, such would be the 
behaviour of “ordinary” particles. But experiments completely refute 
such a supposition. The “arrangement” of electrons in accordance 
with energy levels is governed by the Pauli exclusion principle. 
The first conjecture that such a principle exists was based on a study 
of the Mendeleyev periodic table. 

As was indicated above, it follows from the Schrédinger equation 
that (21 + 1) states exist for a given n and Jl. It was also indicated 
that this in turn yields n? different p-functions for a single value 
of n. The first values of n? are 1, 4 and 9. Let us examine the Men- 
deleyev periodic table. Helium, neon and argon, which complete 
the first three periods of the table, contain 2, 8 and 18, i.e., 2n?, 
electrons, respectively. This is by no means accidental. It is an 
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expression of a profound law according to which only two electrons 
can have the same y-function. In other words, an energy level cannot 
be occupied by more than two electrons. This general law of nature, 
to which we shall return in Sec. 190, is called the Pauli exclusion 
principle. 

By means of this principle, we can “arrange” the electrons of a 
complex atom in accordance with quantum numbers and, therefore, 
in accordance with energy levels and values of angular momentum. 
A helium atom has two electrons, which may occupy the single 1s 
level. The third electron of lithium must be located at the next 
level, i.e., the 2s level. A beryllium atom has four electrons, which 
occupy the 1s and 2s levels. The fifth electron of boron is at the 2p 
level. At this level, there are six places for electrons, which are all 
filled when neon is reached. But let us postpone consideration of 
the relation of the Mendeleyev periodic law to the electron structure 
of an atom until Sec. 192. 

Do two electrons occupying a level which is characterised by the 
same three quantum numbers differ in any way? It turns out that 
two such electrons differ with respect to the orientation of their 
internal angular momentum (“spin” orientation). 


189. Deflection of an Atomic Beam in a Magnetic Field 


In the preceding articles, the angular momentum of an electron 
due to its motion about a nucleus was discussed in considerable 
detail. The presence of such angular momentum may be demonstrat_ 
ed since an atom acquires a magnetic moment as a result of the 
motion of its electron. 

Let us employ classical concepts and assume that the electron 
revolves in a circle of radius r. Since current is equal to the charge 
transferred in a unit time, the equivalent current of such a revolving 
electron is J = ne, where 7 is the number of revolutions per second. 


On the other hand, n =s) where v is the velocity. Thus, 


ve 


~~ Qur 
and the magnetic moment of the revolving electron (see p. 263) is 


M=—IS=+ a pe eur. 


c 2mr 2c 
L, the angular momentum of an electron, is equal to mur. Hence, 
é 
M = 2me 


This relationship between the angular momentum and the magnetic 
momentum of an electron moving about a nucleus, obtained by 
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means of the above simple calculations, has been exprimentally 
confirmed for all atomic electrons. 

Thus, atoms for which Z = 0 possess magnetic moments, and it 
may be demonstrated in a suitable experiment that such atoms 
behave like small magnets. Fig. 218 illustrates such an experiment 
in which a parallel beam of atoms passes through a nonuniform 
magnetic field. 

Air is carefully pumped out of a long tube. At the left end, an 
atomic gas is created. Atoms in the left compartment can get out 


Fig. 248 


through small apertures in the screens. Since there are two aper- 
tures, only atoms moving along the axis of the apparatus will remain 
in the beam. This parallel beam of atoms passes through a nonuni- 
form magnetic field. As was indicated on p. 287, in such a field 
the force acting on a body which has a magnetic moment M is 


dH 
f=M.- ’ 


where = is the field gradient in the direction perpendicular to the 
atomic beam and M, is the projection of the magnetic moment on 
the direction line of the gradient. If the magnetic moment is perpen- 
dicular to the field, no force acts on the body, but if the moment is 
directed along the field, the body is attracted to either the north or 
south pole, depending on the direction in which M is pointed. If 
the atomic beam contains atoms having different magnetic mo- 
ments, or differently oriented magnetic moments, the beam of 
atoms will spread out: atoms travel in different directions and 
different forces act on them. The beam is allowed to impinge on 
a plate until enough atoms hit the plate to be perceptible. 

The above experiment has been of great importance in the devel- 
opment of basic atomic theory. It is still of importance as a method 
of determining the magnetic moments of atomic nuclei. 
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190. Electron Spin 


By means of experiments with atomic beams, one can measure Jf 
ind, therefore, L. Bne of the important deductions of quantum 
mechanics pertains to the quantisation of angular moment L: the 
-otal momentum Z can assume only a discrete set of values, namely, 


L=Vllene, 


where / is the azimuthal quantum number. Therefore, the magnetic 
moments of atoms also can assume only a discrete sel of values: 


J ———— ——— 
M= Tag VU) =pVIF DD). 
The coefficient in this formula, 


= hh 30 (997 x 10-°° erg/gauss, 


~~ 4mume 


is called the Bohr magneton”. 

In experiments with atomic beams, the external magnetic field is 
. directed perpendicular to the atomic beam. The projections of LZ on 
this perpendicular line can also have only a discrete set of values: 


h 
Pate 
hence 
M,=mn, 


where m is the magnetic quantum number. It is seen that the values 
of AY, must be equal to a whole number of Bohr magnetons. 

M, may be determined directly from experiments with atomic 
beams. What should be the result of experiments with beams of 
atoms? We may expect the following picture. Hydrogen, helium, 
lithium and beryllium have only s electrons. Since Z = O for these 
atoms, such a beam does not split up in a magnetic field. When p 
electrons are present, the beam may be expected to split up into 
three distinct components: an undeviated central component for 
m = 0, and two components arranged symmetrically to the right 
and left for m = + 4. For d electrons, we should obtain five beam 
components corresponding to the five possible values of the quantum 
number m, etc. 

These predictions are realised to the following extent: in certain 
cases a beam of atoms does not split up, while in others it splits up 
into distinct components. Thus, it is evident that some atoms have 


* Niels Bohr (1885-1962) noted Danish scientist who developed the first 
theory of atomic structure. 


190. Electron Spin : 497 


no magnetic moment; if an atom has a magnetic moment, it is 
quantised. 

As regards the results obtained for specific atoms, they point to 
a completely new fact, namely, that an electron has an internal 
magnetic moment. : 

Such is the conclusion to be drawn from an experiment with 
a beam of hydrogen atoms: a beam of hydrogen atoms splits up into 
lwo symmetrical components, which correspond to the deflections of 
atoms with magnetic moments +. There is no undeviated central 
component! This fact may be explained by the following hypothesis, 
which is supported by other data: an electron has a magnetic moment 
and there are only two possible orientations of this moment in 
space, namely, the projections +p on the direction line of the 
external field. 7 

The magnetic moment of an electron due to its motion about a 
nucleus is uniquely related, as we have just seen, to the angular 
momentum of the electron motion about the nucleus. It also tran- 
spires that the internal magnetic moment of an electron is related 
to ils internal angular momentum, or spin. 

As far back as 1925, before the above experiments which show 
that an electron has an internal magnetic moment were performed, 
Goudsmit and Uhlenbeck suggested thal an electron has a spin. 
These physicists showed that such a hypothesis—the existence of 
an electron spin, i.e., an internal angular momentum—removes 
insurmountable difficulties in deciphering spectra. At first, it was 
proposed that spin is a consequence of the rotation of an electron 
about its own axis (whence the origin of the term “spin”). But this 
interpretation is incorrect. Electron spin is a primary characteristic 
and is not reducible to something simpler. 

What is the relationship between the internal angular momentum 
(spin) and the internal magnetic moment of an electron? The exper- 
iment with a beam of hydrogen atoms leads to the conclusion that 
M,, the projection of the internal magnetic moment of an electron, 
may assume only two values, namely, +p. It may be assumed that 
L,, the projection of the spin, may also assume only two values. 

If the formula 


L=Viten 


is applied to the internal angular momentum of an electron, one 
finds that the number Z has a single value. Thus, as quantum mechan- 
ics indicates, the values 1 = 0, 1, 2,... correspond to 1, 3,5, ..., 
and in general (2J + 1) states, respectively. To obtain two spin 
states, which the experimental results show to be the case (27 + 1 = 


= 2), one must assume that l= >. 


32-1409 
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The absolute magnitude of an electron’s internal angular momen- 
tum (spin) has the single possible value L = Vee . As for the 
projection of spin, assuming as before that the differences in the 


possible values of ZL, must be multiples of os we see thal it can 


1 i { } . 
assume only two values, namely, + > x and — re . Thus, (Lz)sp = 
J ] 


h . s 
=6x = , where 6 is a new quantum number (spin number) that 


can assume only two values, namely, +. 
It was stated earlier that the hydrogen experiment led to deflec- 
tions corresponding to a magnetic moment of one magneton, and 


that M, =. Since the quantum number 6 is equal lo , the relation 


Ma 7 


2mc 


turns out ‘to be incorrect for the internal motion of an electron. 
Agreement with experimental results is obtained if 


e 
M sp == Lop. 


Thus, a , the ratio of magnetic moment to angular momentum for 


eleclron motion about a nucleus, is one half of the analo 
for internal motion of an electron. 

The existence of electron spin enables us to formulate the Pauli 
exclusion principle more clearly. There can be no more than two 
electrons in a state having quantum numbers n, J ‘and m. Such 
electrons differ only with respect to spin projection. Experiments 
show that the spin projections of two such electrons cannot be the 
same. The Pauli exclusion principle may now be formulated as 
follows: There can be only one electron in a state characterised. by 

- four quantum numbers n, J, m and 6. In other words, if there are 


two electrons in an n, 1, m state, their spin directions are opposite 
to each other. 


gous ratio 


191. Magnetic Moments of Atoms 


The electron spin hypothesis makes it possible to interpret the 
results of experiments with atomic beams. The measurement of 
magnetic moment is one of the most important methods of deter- 
mining the electron state of atoms. Let us consider the first few 
elements of the Mendeleyev periodic table. 
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We have already studied the hydrogen atom. What happens to 
a beam of helium atoms? Such a beam does not split up. This is as 
it should be. This atom has two electrons in the 2s state. The Pauli 
exclusion principle requires that their spins, and hence their mag- 
netic moments, be oppositely directed; the total magnetic moment 
is equal to zero. 

In a lithium atom, the magnetic moment must be determined by 
the third electron since the actions of the spins of the first two elec- 
irons in the 1s state annul each other. The third lithium electron 
is in the 2s state. Therefore, like in the case of hydrogen, the mag- 
netic moment can be determined only 
by the spin. The splitting up of a lithi- Boron 
um beam into two components shows $$ 
that this analysis is correct. Like in m=) 


the case of hydrogen, one component — / 
corresponds to a spin projection +-, , 

ahds 4 f 

and the other to a spin projection — Es ar — 15 

A beryllium atom has four elec- ~x—-4 


trons—two in the 4s state and two in 
the 2s state. Since the spins of the elec- 
trons in each pair are oppositely direct- 
ed, they annul each other and yield 
a total moment of zero. Therefore, a 
beam of beryllium atoms does not — 

lit up into components. S 
eee a beam of boron atoms Besar? 
splits up into four components. 
How can this be explained? It is evident that the magnetic moment 
is produced only by the fifth electron, which is in the 2p state. 
This state may be realised with three values of the magnetic quan- 
tum number, namely, +1, 0 and —1. Thus, the orbital magnetic 
moment may have three values, including zero. The values of the 
spin magnetic moment must be added to those of the orbital mag- 
netic moment. How is this done? Fig 219 shows all the’ possible 
mutual orientations of the moments. It is seen that four combinations 


7 a ‘ble: 3. 1 ond —2 fin 2 
of angular momentum are possible: >, >, —-7z an — (in om 


units, usually not indicated) . 


The splitting up of a beam of carbon atoms is even more complex: 
seven lines, including one which is not deflected, with the following 
values of angular momentum: 3, 2, 1, 0, —1, —2 and —3. 


32* 
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192. The Mendeleyev Periodic Law 


The regularities of the Mendeleyev periodic system become un- 
derstandable when viewed in the light of available data on the 
electron structure of atoms. 

The basic rules of distribution of electrons in an atom according 
to quantum numbers are derived from the Pauli exclusion principle. 
But if we were governed by this information alone, only the electrons 
of the first eighteen atoms (through argon) would he assigned correct 
quantum numbers. This may be explained as follows. Electrons 
fill the energy states of an atom in consecutive order. In a hydrogen 
atom, the energy levels are degenerate, the energy being determined 
by only one quantum number. In mullti-electron atoms, degeneracy 
is removed as a result of interaction and the energy of an electron 
in an atom depends on all the quantum numbers. It goes without 
saying that in a hydrogen atom an electron at a level for which 
n = 3 has less energy than an electron at a level for which n = 4. 
This is not necessarily so in multi-electron atoms when the azi- 
muthal quantum number is large. Thus, in a number of cases, the 
“natural” consecutive order of quantum numbers does not correspond 
to the order in which the energy states of an atom are filled. 

It is customary to group the electrons of an atom into shells, 
whereby electrons having the same principal quantum number are 
said to belong to the same shell. The shells are usually designated 
by the following letters: 


Key Diy ME IN eceng 


which correspond to n = 1, 2, 3, 4, ..., respectively. 

The electron cloud of an atom is described in the main by the 
distribution of electrons according to quantum numbers or shells. 
This distribution is represented by formulas which indicate by a 
superscript the number of electrons having the same n and 1. For 
example: 


silicon — 1s? 2s? 2p° 3s? 3p?; 
calcium — 14s? 2s? 2p% 3s? 3p% 4s?. 


To determine to what extent a shell is complete, one should 
remember that the maximum number of electrons in the subshells 
s, p, d,f, ... is 2, 6, 10, 14, ..., respectively. These values are 
obtained from the formula 2 (21 + 1). 

Returning to the Mendeleyev table, let us see where the order of 
distribution of electrons according to quantum numbers is violated. 
The first such violation occurs for potassium. The last electron is in 
the 4s level rather than in the 3d level. Calcium, the next element 
in the table, receives another 4s electron. Then, beginning with 
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scandium, the 21st element, the 3d level is built up. But when we 
reach chromium, the 24th element in the table, a new anomaly 
arises. The order of distribution of quantum levels according to 
energies has changed. It becomes unfavourable from the energy 
viewpoint to have two electrons in the 4s level. The configuration 
of chromium is therefore 3s? 3p® 3d° 4s. 

We shall not discuss the remaining anomalies. The electron con- 
figurations for all elements may be found in any physics or chemistry 
handbook. The main point is the following: the distributions of 
electrons according to quantum numbers, which are explained by 
purely physical methods of investigation (spectral analysis and the 
Measurement of magnetic moments), aid us in understanding the 
chemical properties of the various elements. 


193. Ionisation Potentials 


One of the methods used to study the electron configuration of 
an atom is to measure its ionisation energy, i.e., the energy that 
must be expended in removing an electron from the atom. Since 
the energy of an electron in an atom is negative and is reckoned 
from zero (the energy of an electron removed from an atom), the 
ionisation energy is simply equal to the energy level occupied by 
the electron in the atom. It is customary to refer this energy to the 
electron charge and express it in volts. For example, we say that 
the ionisation potential of a hydrogen atom is equal to 13.53 volts. 
This means that to free an electron one must perform work equal 
to that of moving an electron through a potential difference of 
13.53 volts. Fig..214 shows the significance of this value. ? 

In the case of a multi-electron atom, one may find a series of 
ionisation potentials which characterise the levels of the first, 
second, third, etc., electrons, calculating from the position of the 
least bound electron. In this sense, one speaks of the first, second, 
etc., ionisation potential of a given atom. 

There exist many methods of measuring ionisation potentials. 
For this purpose, gases or vapours are placed in an electric field. 
A stream of electrons emitted by a heated filament ionises the gas. 
As long as the energy of a primary electron is insufficient to dislodge 
an atomic electron, the electric current passing through the gas 
does not change. When the energy of the primary electrons becomes 
sufficiently great to dislodge electrons from the atomic gas, a con- 
siderable number of positively charged ions will be present in the 
region and a sharp increase in the electric current occurs. By grad- 
ually increasing the voltage applied to the apparatus, one can 
determine very accurately the instant when this increase in electric 
current begins. This critical value of voltage gives the magnitude 
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of the ionisation potential. The values of the first ionisation poten- 
tials of most chemical elements are graphically illustrated in 
Fig. 220. 

It is easily seen that the periodicity of this property completely 
conforms with that of the periodic table. It is most difficult to 
dislodge an electron from a helium atom and atoms of the other 
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noble gases. This is precisely the reason for their chemical inertness. 
The univalent alkali metals have the lowest potentials. This, too, 
is completely understandable to the chemist, who is familiar with 
the exceptional ability of such substances to enter into reactions. 

The values of the first and successive ionisation potentials are 
related to the valency of the atoms. Atoms of the alkali elements 
are univalent because one electron, which is on the outer ring of 
the atoms of these substances, is more weakly bound than the rest 
of the electrons. The first potentials of a caesium atom, for example, 
have the following values: 3.9, 27, 46 and 62 volts. It is seen that 
the differences between the energies necessary to dislodge the first 
and succeeding electrons are quite large. 


194. Atomic Spectra in the Optical Region 


Atomic absorption spectra as well as emission spectra may be 
obtained, but only the latter are of basic importance. Atomic spectra 
of emission in the optical region may be obtained by spectroscopic 
investigation of the radiation produced by gases and the vapours 
of bodies which are solid at normal temperatures. 
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In order for atoms to radiate, they must be excited, i.e., made to 
pass from a lower energy level to a higher one. When atoms return 
to lower energy levels, an emission spectrum is produced. For every 
transition, there is a corresponding line in the spectrum. 

Atoms may be excited by various means. One method consists 
in the use of a gas discharge. The voltage applied to a gas-discharge 
tube accelerates the charged particles in the gas. These particles 
collide with neutral atoms, to which energy is transmitted by impact. 
Another method, which is used in the spectral analysis of metals, 
consists in the creation of arcs or sparks between two electrodes 
made of the material under investigation. Very high temperatures 
are produced in an arc or spark, resulting in the vaporisation of the 
substance in the region of the discharge. The atoms are excited 
as the result of collisions. 

An atomic emission spectrum consists of a very large number of 
sharp lines. The radiation frequency corresponding to a given line 
satisfies the equation hv,,, = E,, —.E,. Thus, by measuring the 
frequencies of radiated light, we can determine the differences in 
the energy levels of a given atom. One can reliably interpret atomic 
spectra, i.e., determine energy level patterns, from the values of 
radiation frequencies. Handbooks provide data on the spectral lines 
and energy levels of the chemical elements. 

It should not be supposed that a spectrum contains lines corre- 
sponding to all transitions from any one level to any other. Experi- 
ments have confirmed, and a theoretical basis has been provided 
for, the fact that certain selection rules exist. Certain transitions 
are forbidden, i.e., they do not exist. 

One cannot predict, of course, to which lower energy state an 
excited atom will pass, and what will be the frequency of the radiat- 
ed spectral line. But not all transitions occur with equal proba- 
bility. In principle, the probability of transition from one level 
to another may be theoretically calculated. The magnitude of this 
probability determines, in the main, the intensity of the correspond- 
ing spectral line. 

Atomic spectra are affected by external fields. If the substance 
under investigation is located in an electric or magnetic field, a 
number of its spectral lines split up into several components. The 
energy of a system having a magnetic moment M and located in an 
external magnetic field H is given by the expression JfH (see p. 275). 
States which have the same quantum numbers 7 and J may differ 
from each other with respect to the projection of the magnetic 
moment on the direction line of the magnetic field. Therefore, the 
application of a magnetic field removes the degeneracy of energy 
levels and atomic electrons having different magnetic quantum 
numbers will have different energies. 
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Investigations of atomic spectra of emission in the optical region 
are of great practical importance. Such a method of spectral anal- 
ysis constitutes a very sensitive means of determining the chemical 
composition (up to 10-!° gm) of substances, primarily alloys, and 
in a number of cases is more sensitive than chemical analysis. 

Optical frequencies usually arise with relatively weak excitation 
of an atom, i.e., when outer, valence electrons are transferred to 
a higher level. But even a very “high” electron can produce a broad 
spectrum. It would appear that the radiation frequency has no lower 
limit. Thus, the energy level diagram shows that as n increases the 
levels come closer together (Fig. 214 shows the levels and transi- 
tions for hydrogen, but in principle the patterns are the same for 
other atoms). This means that transitions corresponding to very 
low frequencies (long wavelengths) occur. However, experiments 
show that spectra produced by outer electrons, even though they 
extend into the infrared region, do not include lines of very long 
wavelength. It must be concluded that the probability of a transi- 
tion to some energy level such as the 21st is not large, and the proba- 
bility of a transition from the 24st to the 20th, for which a photon 
of low v would be radiated, is quite negligible. 

In the direction of high frequencies (short wavelengths), the fre- 
quency is limited by the ionisation potential. With respect to the 
“highest” electron, the potential of helium is the greatest and that 
of caesium the lowest, viz., 24v and 4v, respectively. This, corre- 
sponds to radiation, frequencies of 6 x 101° cps (4 = 500 A) and 
10*® cps (A = 3,000 A), respectively. Thus, only a high-level electron 
can bring us into the region of very short ultraviolet wavelengths, 
which, relative to characteristic X-ray radiation, may also be called 
a region of very long wavelengths. 

It is quite understandable that electrons of inner shells can be 
raised to high levels with strong excitation. In such a case, the 
characteristic spectrum includes X-rays. 
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In multi-electron atoms, the ionisation potentials of low levels 
reach high values. The excitation of such atoms may, therefore, 
result in the radiation of X-rays (wavelengths of the order of 0.4-40 A). 
An energy of the order of 104 ev must be imparted to an atom in 
order to produce X-ray radiation. This may be achieved in gas- 
discharge tubes by applying a voltage of tens of thousands of volts. 

One may calculate the value of the temperature at which an atom 
begins to radiate X-rays due to thermal collisions with other atoms. 
If the average kinetic energy per degree of freedom is to be of the 
order of 10* ev, the temperature must be of the order of 108 °K. 


195, Atomic X-Ray Spectra 505 


Such high temperatures are achieved in solar and celestial atomic 
explosions (see p. 582). The X-ray radiation of the Sun may be 
determined by means of instruments placed in artificial Earth 
satellites. 

A practical method of obtaining X-rays is by the bombardment 
of a solid (the anti-cathode of an X-ray tube) with a stream of elec- 
trons. Electrons impinging on the anti-cathode are abruptly braked. 
As a result, a continuous spectrum of X-rays is obtained. The elec- 
tron energy, which has been increased to a value ¢; by acceleration 
in an electric field, decreases to a value @2 as the result of braking. 
The energy difference 6, — G6: =v is released in the form of 
radiation. G: may assume any value from 6, to zero. Hence the 
radiation frequencies lie between the limits v= ot and zero. The 
electron energy which does not go into radiation is transformed into 
heat. Only about one-hundredth of the energy of the electron beam 


is transformed into X-ray energy. Evidently, the continuous X-ray 
c he 


spectrum has a short-wavelength limit Anin = ener ae Substi- 


luting the values of the constants, we obtain 


12.3 


Amin = vT? 
where A is expressed in Angstroms and V in kilovolts. Beginning at 
a very definite wavelength, the continuous X-ray spectrum increases 
in intensity with increasing wavelength, reaches a maximum several 
score Angstroms from the short-wavelength limit, and then slowly 
decreases in intensity. 

Investigations show that sharp lines, having a characteristic form 
for every element, are superimposed on the continuous spectrum. 
A characteristic X-ray spectrum arises owing to the fact that some of 
the electrons which impinge on the anti-cathode penetrate the atoms 
and dislodge inner electrons, i.e., electrons in the K, JL, etc., shells. 
An X-ray quantum is produced when a high-level electron passes . 
over to a vacated low-level position. The set of spectral lines due to 
electron transitions to the K level is called the K series, to the ZL 
level the Z series, etc. If the voltage applied to an X-ray tube is 
increased, the series will appear in consecutive order because, as 
the energy of the electrons impinging on the anti-cathode is increased, 
more and more low-energy levels will be consecutively vacated and 
made available for transitions. The K series will be the last to appear. 

The general scheme of electron X-ray transitions is shown in 
Fig. 221, where heavy dots indicate initial levels. The most intense 
lines are marked on the diagram. However, some transitions are 
missing since they are forbidden by the selection rules. This pertains, 
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for example, to transitions with the same value of azimuthal quan- 
tum number. 

Since the configuration of completed lower shells is the same for 
all atoms, X-ray spectra diagrams of different atoms are very similar 
to each other. All spectra contain typical sequences of lines, which 
are systematically shifted along the wavelength scale in accordance 
with the atomic number of the elements. For example, all elements 
produce a strong a doublet (Ka, and Ka-) and a weaker B doublet. 
Quite often these doublets are unresolved. In such a case, one speaks 
of the a line and f line of a given element’s K series. These doublets 
are of a “spin” nature. 

Fig. 222 shows that a characteristic spectrum is gradually shifted 
into the short-wavelength region with increasing atomic number of 
the element giving rise to this spectrum. This law was discovered 
by Moseley. Its physical basis is the steady increase of the inter- 
action force between an electron and a nucleus with increasing 
nuclear charge. The formula expressing this law will not be present- 
ed, but the systematic displacement of the lines is clearly evident 
from the figure. 


CHAPTER XXIX 
MOLECULES 


196. Chemical Bonds 


A molecule is a stable configuration of atoms. Every atom in a 
molecule occupies a stable position. The displacement of an atom in 
any direction results in an increase in the potential energy of the 
molecule. When an atom approaches a neighbouring atom there is 


U 


Fig. 223 


a force of repulsion, and when it recedes a force 
of attraction. Every atom of a molecule, and the 
molecule as a whole, is in a potential well. 

The form of the potential curve of an atom 
or molecule is quite evident (see Fig. 223). Since 
it is not possible to reduce the distance between 
atoms to zero, the curve of potential energy as 
a function of their separation rises sharply as 
this distance decreases. In’ the direction of 
increasing separation, the curve rises from the 
equilibrium position, i.e., the bottom of the 
well, much less sharply. Variations are possible: 
The potential energy at great distances may be 
more or less than at the bottom of the well and 
the well may have or may not have a clearly 
defined wall. The energy of a molecule may be 
more or less than the sum of the energies of its 
atoms. Accordingly, when atoms are combined in 
a molecule, heat is either released or absorbed 
(see p. 568). 


An atom in a potential well is bound to its neighbours. What is 
the reason for this bond? Do various types of bonds exist? Ionic 
and homopolar bonds are two ideal classifications of chemical bonds. 
In the overwhelming majority of cases of interest in chemistry, one 
of these two types of bonds, or an intermediate case in which both 
ideal types coexist, occurs. 

If an atom can transfer one or several electrons to another atom 
electrostatic attraction will occur between the ions which are formed. 
This is what is meant by an ionic bond. At a certain interatomic 
separation which is characteristic of this pair of ions, the forces of 
electrostatic attraction are counterbalanced by the repulsion of the 
electron clouds of the atoms. 
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If an atom is to transfer an electron to another atom, it is neces- 
sary that this process be advantageous from the energy viewpoint. 
In such a case, the simple tendency to pass over to the lowest energy 
level will result in the transfer of an electron. 

It was shown on p. 488 that an electron can be torn away from 
a neutral atom by an expenditure of energy equal to the product of 
an electron charge and the ionisation potential of the alom. Thus, 
the formation of a positive ion is always associated with an expendi- 
ture of work. On the other hand, the formation of a negative ion, 
i.ec., the attachment of an electron to a neutral atom, is associated 
with a release of energy. To be sure, this applies only to the first 
electron. The attachment of a second electron to a singly charged 
negative atomic ion requires an expenditure of work to overcome 
the electrostatic repulsion. 

An ionic bond can exist if the energy of tearing away an electron, 
i.e., the work of creating a positive ion, is less than the sum of the 
energy released in the formation of negative ions and the energy 
due to electrostatic attraction between the ions. 

The alkali metals, in which the last electron is just beginning to 
form a new shell, have the lowest ionisation potentials. In the 
alkaline earth metals, each atom has two loosely bound electrons. 
It- is evident that the formation of a positive ion from a neutral 
atom requires the least work when the electrons to be torn away 
are just beginning to form a new shell. 

On the other hand, it turns out that the most energy is released 
when an electron becomes attached to a halogen atom, in which the 
outer shell is one electron short of being complete. Therefore, in a 
large number of cases, an ionic bond is formed when a transfer of 
electrons resulting in the creation (in the formed ions) of closed 
electron shells, characteristic of atoms of the noble gases, occurs. 
In this way, the physical significance of potential wells in such 
molecules as NaCl and MgCl. may be easily explained. 

However, this explanation is not valid in all cases. For example, 
diatomic molecules of hydrogen, oxygen, etc., are not covered by 
this explanation. It cannot be assumed that in uniting one of these 
atoms is transformed into a negative ion and the other into a positive 
ion. Theoretical arguments need not be mustered. The physical 
properties of molecules formed of ions indicate whether an ionic 
pond exists or not. Specifically, ionic compounds dissociate and 
form electrolytes. A large class of organic molecules do not behave 
in this manner. Therefore, for such substances, an ionic model is 
clearly not applicable. 

How can one explain the bond between atoms of such molecules? 
We must determine whether or not a gain in energy occurs when, 
say, two hydrogen atoms unite to form a molecule. 
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Such a gain does take place, and the conditions for its occurrence 
are given by quantum mechanics. As was stated on p. 479 the electron 
of a hydrogen atom behaves, in the main, like an electron in a po- 
tential well. The zero energy level of an electron in a potential well 
is determined by the dimensions of the well (see p. 480), i.e., the 
smaller its dimensions the greater the zero-point energy. Thus, any 
expansion of the region in which the electron could move results 
in a decrease in energy. 

Now, imagine that two hydrogen atoms, which have one electron 
each, come into contact with each other. Since the Pauli exclusion 
principle allows two electrons to be in one state, the regions in 
which the electrons exist may merge and create a potential well of 
increased dimensions. This can occur only for two electrons having 
opposite spins. 

If a third atom approaches the hydrogen molecule which has 
formed, the argumentation used above is no longer applicable. The 
third electron cannot merge its region of motion with that of the 
electrons in the hydrogen molecule since this is not allowed by the 
Pauli exclusion principle: the vacant sites of the hydrogen molecule 
are occupied by two electrons of opposite spin. 

Thus, the second type of bond, a so-called homopolar bond, is 
provided by a_pair of electrons of opposite spin. In the case of an 
ionic bond there is a transfer of electrons from one atom to another, 
but here the bond is achieved by joint action of the electrons, i.e., 
it is as if a common region of motion has been created. An expansion 
of the region in which an electron may move results in a decrease 
in energy and this is the reason for the formation of a potential 
well. This bond—the merging of the electronic clouds of electrons 
noe opposite spins—is the main type of bond in organic mole- 
cules. 

Each atom is capable of forming a limited number of homopolar 
bonds. Two electrons of opposite spin, having a common “living 
space” in the form of the overlapping clouds of their wave functions, 
take part in the creation of each bond. : 

As we know, s electrons have spherically symmetrical »p-func- 
tions, but the wp-functions of p, d and f electrons extend in specific 
directions. Therefore, a homopolar bond between any two electrons, 
except s electrons, will be a directed bond. If a bond has formed 
between two atoms, the electronic clouds of these atoms assume a 
definite orientation relative to the first bond line. Thus, only cer- 
tain specific angles are formed between bond lines emanating from 
these atoms. The values of such normal bond angles may be derived 
from quantum mechanics for all atoms. 

To a certain extent, both types of bonds are ideal. We frequently’ 
encounter cases in which the physical and chemical properties of 
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a molecule make it necessary to adopt an intermediate bond mecha- 
nism. In an ionic bond an electron is completely transferred from 
one atom to another and in a homopolar bond each electron belongs 
equally to both bound atoms, but in intermediate cases the electrons 
implementing a bond may spend more time near one of the atoms 
than the other. Such a model reflects, for example, the existence 
of an ionic bond in which the bond electrons belong most of the 
time to a negative ion and the existence of a homopolar bond in 
which the bond electrons spend almost the same amount of time 
with each of the bound atoms. Intermediate bonds of any percent- 
age of “ionocity” are possible. 
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A vast amount of data on spacings between the centres of atoms 
in molecules and crystals has been accumulated. Most of this data 
has been obtained by diffraction methods. If we do not insist on 
a very high degree of accuracy, it turns out that it is possible to 
represent molecules by models which give the shape and dimensions 
of the molecules. 

Models of molecules of the NaCl type, in which the atoms are 
joined by an ionic bond, are particularly simple. Each ion may be 
represented by a sphere having a definite radius. The dimensions 
of a number of ions are given in the following table: 


Ton Lit Nat K+ Cs+ r- Cl- 


Br- | I> 


Jon radius, A 0.60 | 0.95 | 1.33 | 1.69 | 1.36 | 1.81 | 1.95 | 2.16 


By means of such a table, we can determine the spacing between 
the centres of ions in, any salt. For example, in NaCl it is equal to 
0.95 + 41.81 = 2.76 A. 

But what is the significance of the assertion that an ion may be 
represented by a sphere? To show that such a representation is justi- 
fied, we must determine how closely to a molecule (say NaCl) another 
ion (sodium or chlorine) may approach. This is possible since exper- 
iments indicate that both fused and solid salts consist of ions. It 
turns out that a second and a third ion approach a given ion just 
as closely as the first. Moreover, ions the charges of which have the 
same sign may also approach each other to a distance equal to the 
sum of the ion radii. Thus, ions behave like spheres. 

An important conclusion regarding ionic molecules may be drawn 
from these geometric facts. Let us assume that a group of molecules 
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are gathered closely around one of the molecules. The arrangement 
of ions is shown in Fig. 224. Since complete equalily exists in inter- 
atomic spacings, itis no longer possible to say with which chlo- 
rine neighbour of a given sodium ion, or with which sodium neigh- 
bour of a given chlorine ion, a molecule is formed. The concept 
of a molecule has lost its meaning. 

We must conclude from the geometric arrangement of atom centres 
that in a concentrated state, i.e.; in a liquid or solid, where the 
atoms are linked by ionic bonds, molecules do not exist as distinct 
formations. The concept of a molecule turns out to be inapplicable. 

But what ‘is the situation in the case of a gas? Upon vaporisation, 
a pair of ions of opposite charge, the net electric charge of which 
is equal to zero, is most easily torn 
away from a liquid. Therefore, basi- 
cally, molecules of the NaCl type are 
to be found in vapours. However, in 
addition to molecules, ions are also 
vaporised from a liquid. 

The situation is entirely different in 
the case of molecules having homo- 
polar, i.e., covalent, bonds. An anal- 
ysis of interatomic spacings encoun- 
tered in molecules shows that spacings 
between atom centres may be calcu- 
lated by means of so-called atomic 
radii. The values of such radii in Angstroms for the most often 
encountered atoms are as follows: 


CcC— C= C= H-— -O— O= 
0.771 0.665 0.602 0.30 0.66 0.55 


O 
0. 


Sill 


Atomic radii decrease with increasing multiplicity of the valent 
bond. The table shows that the separation between two bound carbon 
atoms, C—G, is 1.54 A; the separation in C—H is 1.07 A, etc. 

In constructing a model of a molecule, we also have at our dispos- 
al certain elementary data on bond angles. The existence of normal 
bond angles is understandable from considerations of symmetry and 
is in agreement wilh certain qualitative quantum mechanical reason- 
ing discussed in the preceding article. Thus, the normal bond 
angle of a carbon atom that is linked to four atoms is a tetrahedral 
angle (109°28’). In the case of an aromatic carbon atom, as well as 
other carbon atoms that are linked to three atoms, the normal bond 
angle is equal to 120°. Finally, the characteristic bond angle of 
a carbon atom that is linked to two atoms is 180°. 

The normal bond angles of oxygen, sulphur and nitrogen atoms 
which are linked to two atoms in the case of oxygen and sulphur 
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and to three aloms in the case of nitrogen are equal to 90°. The nitro- 
gen atom in the nitro group NO, has a normal bond angle of 120°. 

In a number of cases, bond angles deviate considerably from the 
“normal”. In certain cyclic compounds of the cyclobutane type, the 
angles are equal to 90° rather than 109°28’. Such deviations are 
due to spatial obstacles. However, before discussing this, we must 
clarify a third geometric characteristic of a molecule, namely, its 
intermolecular radius. 

Investigations of molecular arrangements in crystals have shown 
that each atom may be assigned an intermolecular radius, such that 


Fig. 225 


on the average neighbouring molecules will touch each other, Thus, 
for example, the intermolecular radine of hydrogen is 1.17 A, oxy- 
gen—1.36 A, nitrogen—1.57 A, etc. This does not mean, however, 
that the distances between atoms ok the same molecule which are 
not linked by valent bonds are deterssiaed by these values. The 
dimensions and the form of a molecule are determined by inter- 
action between the forces establishing equilibrium distances between 
atoms which are not linked by valent bonds and the forces establish- 
ing normal bond angles. Since the bond forces between atoms are 
an order of magnitude greater than the other forces, the interatomic 
distances do not change and the configuration of the molecule is 
determined by competition between the elasticity of a bond angle 
and the compressibility of the intermolecular sphere of an atom. 
Here is a simple, but graphic example. Experiments show that 
the bond angle in a molecule of water is equal to 105°. The distance 
between hydrogen atoms is 1.54 A. Therefore, considerable compres- 
sion of the intermolecular spheres of the hydrogen atoms occurs. 
This compression (2 X 1.47—1.54 = 0.8 A) is balanced by the 
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elasticity of the bond angle, the normal value of which is equal 
to 90°. Thus, the forces which compress the hydrogen atoms by 0.8 A 
are equal to the forces which change the angle from 90° to 105°. 
Such a simple mechanism explains the difference between the struc- 
ture of a hydrogen sulphide molecule and that of a water molecule. 
Since the length of the hydrogen-sulphur bond is considerably 
longer than that of the hydrogen-oxygen bond, the hydrogen atoms 
of the former molecule are considerably less “crowded”. It turns out 
that in hydrogen sulphide the distance between hydrogen atoms is 
equal to 1.99 A and the bond angle is equal to 92°. The compression 
of hydrogen atoms by 0.35 A is balanced by a change of only 2° 
in the bond angle. Many organic molecules may be used to illustrate 
the validity of this mechanism. 

Fig. 225 shows the structure of a chlorobenzene molecule. To the 
left is a even atom; in the centre, the beginning of the build up, 


viz.,a C group; and to the right, a model of the molecule. 
c” Nc 


198. The Electronic Cloud of a Molecule 


Electron motion in a molecule, just as in an atom, is described 
by a wave function. Strictly speaking, the w-function is a function 
of 3n coordinates, where n is the number of electrons in the molecule. 
Then, >? will give the probability of any electron distribution, 
i.e., the “electron density”. 

It has been noted already that the solution of the Schrédinger 
equation for multi-electron atoms is very complex. In the case of 
molecules, the difficulties are, of course, even greater. Only approx- 
imate, semiempirical methods of calculation are applicable here. 
In this connection, physical methods of determining electron den- 
sity are of particular importance. However, even when such methods 
are used, the results are quite limited. 

The time-averaged electron density of a molecule is determined 
by means of X-ray structural analysis (the electron density gives 
the probability that the electrons are at a given location). As a 
result of the vibrations of atoms inside a molecule, and of the mole- 
cule as a whole, a photograph of an electronic cloud is smeared. 
Fig. 167 (p. 388) shows a cross-section of the electron density pattern 
of an anthracene molecule. The coarseness of the method may be 
gauged from the fact that the hydrogen atoms of the molecule are 
not apparent in all cases. The method used in plotting the pattern 
is similar to that used in the construction of topographical maps. 
Electron peaks and valleys are indicated by lines connecting points 
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with the same electron density. Each atom is represented by an 
electron density “hill”. Superposition of the bell-shaped density 
functions of two atoms along a bond line results in the formation 
of a “bridge” between the atoms. Unfortunately, the accuracy of the 
method is too poor to enable us to determine the nature of the chemi- 
cal bond by measuring the height of this bridge. Its height is indis- 
tinguishable from the sum of the density functions of two free 
atoms. But the specific nature of the chemical bond should probably 
be manifested in an additional increase in electron density (as com- 
pared with free electrons). Such electronic cloud patterns are, there- 
fore, merely interesting illustrations of molecular structure. 

If the electron density with respect to the atomic nuclei of a mole- 
cule were known, we would be able to calculate the dipole moment 
of the molecule. For this purpose, it would be necessary to determine 
the centres of “gravity” of the positive and negative charges. The 
dipole moment has not yet been determined in this manner, although 
comparison of neutronographic (neutrons scattered on nuclei) and 
roentgenographic data could be used to solve such a problem. How- 
ever, the dipole moment of a molecule can be reliably measured 
(see p. 656) and it is then possible to solve the converse problem, 
namely, determine the centre of “gravity” of negative charge by 
means of the dipole moment. 

It would seem that in purely ionic molecules we encounter the 
extreme case in which the centre of gravity of an electronic cloud 
coincides with the centre of an anion. The dipole moment of KCl, 
for example, could then be predicted in the following manner. If one 
electron is taken from a potassium atom and transferred to a chlo- 
rine atom, one “extra” positive charge will be separated from one 
“extra” negative charge by the distance between the potassium and 
chlorine centres, i.e., 1.81 + 1.33 = 3.14 A. Hence, the dipole 
moment will be equal to 3.14 x 4.8 x 10° = 15 Gaussian units. 
But experiments yield a value of 6.8 Gaussian units. This means 
that even in the case of such a classical ionic bond the potassium 
electron does not go over completely to the anion. On the other 
hand, the other extreme case is fully realised. Evidently, symmetri- 
cal molecules such as Hz, O2 and benzene cannot have a dipole 
moment: the centres of gravity of the electronic cloud and the nuclei 
coincide. 

One other property of an electronic cloud should be mentioned, 
namely, its ability to be displaced relative to a nucleus. Electronic 
clouds may be displaced relative to nuclei by means of an electric 
field. Since nuclei are much heavier than electrons, it may be as- 
sumed that the nuclei remain fixed. The displacement of the electronic 
cloud of a molecule may be described by the displacement of its 
centre of gravity. When the centre of gravity of negative charge is 
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displaced relative to the centre of gravity of positive charge by 
a distance zx, the molecule acquires an induced dipole moment 
p = Nez, where N is the number of electrons in the molecule. The 
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induced dipole moment in- 
creases linearly wilh the field, 
i.e., p = BE. It is customary 
to describe the displacement of 
the centre of gravily of an 
electronic cloud by f, the mag- 
nitude of the polarisability of 
the molecule. The quantity B 
has the dimensions of volume. 
The greater the volume of the 
molecule, the greater the value 
of B (see Chapter XXXV). 


199. Energy Levels of 
Molecules 


The energy of an atom 
changes only by one means: a 
change occurs in its electron 
motion, i.e., an electron passes 
into another quantum state. 
The energy of a molecule 
may also change in this man- 
ner, but by other means as 
well. For example, the atoms 
of a molecule vibrate relative 
to one another. The vibrational 
energy is an integral part of 
the energy of a molecule and 
also may assume only a dis- 
crete set of values. Furthermore, 
a molecule rotates as a 
whole. The rotational energy is 
also quantised and a change in 
the state of a molecule may 
result in achange in rotational 
energy. Therefore, the energy 


state of a molecule is described by indicating the state of its elec- 
tronic cloud (electron level), the state of its vibrational motion 
(vibrational level) and the state of its rotational motion (rotational 
level). We deal with three kinds of information — analogous, so to 
speak, to a house number, floor number and apartment number. 
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But which is analogous to a floor number and which to an apart- 
ment number? Which energy levels are separated by large intervals 
and which by small ones? The answers to these questions are con- 
tained in the energy level diagram shown in Fig. 226, which is 
based on experimental results and theory. Two electronic levels, 
e’ and e”, are shown in this figure. Associated with each electronic 
level is a group of vibrational levels designated by a set of v values, 
and associated with each vibrational level is a group of rotational 
levels designated by a set of j values. 

Clearly, the intervals between rotational levels are less than be- 
tween vibrational levels, and those between vibrational levels are 
less than between electronic levels. 

Let us assume that a molecule may have electronic levels at 100, 
200, 300, ... energy units, vibrational levels at 10, 20, 30, ... 
units, and rotational levels at-1, 2, 3, ... units. In such a case, 
a molecule at the second electronic level, first vibrational level 
and third rotational level will have an energy of 213 units. 

Thus, the energy of a molecule may be given in the form 


W = Wert Woin + Wrot. . 
The frequency of radiated or absorbed light may always be deter- 
-mined from the difference in energy between two levels, i.e., 


v= (AWer-+ AW sio-+ AWrot)- 


It would be interesting to examine transitions involving a change 
in only one “kind” of energy. Practically, this is possible only for 
rotational transitions and it is easily seen why this is so. 

Let us investigate the absorption of electromagnetic waves by 
a group of molecules. Beginning with the longest wavelengths, i.e., 
smallest packets of energy hv, we find that the molecules do not 
absorb energy as long as the magnitude of a quantum of energy is 
less than the difference between two neighbouring levels. By grad- 
ually increasing the frequency, we eventually obtain quanta of 
energy that are capable of raising molecules from one “rotational” 
level to another. Experiments show that this occurs in the micro- 
wave region (at the end of the radio band) or, in other words, in the 
far infrared spectrum. It is found that wavelengths of the order of 
Q.1-1 mm are absorbed by molecules. Thus, a pure rotational spec- 
trum may be obtained. 

By further increasing the frequency, we enable the rotational 
spectrum to become more developed, but nothing new occurs until 
the quanta of energy impinging on the substance are of sufficiently 
high frequency to make molecules pass from one vibrational level to 
another. It is clear, however, that a pure vibrational spectrum, i.€-; 
a series of transitions for which the number of the rotational level 
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does not change, is never obtained. Transitions from one vibrational 
level to another involve various rotational levels. For example, 
a transition from the zero (lowest) vibrational level to the first may 
be accomplished by molecules from the fourth rotational level to 
the third, the third to the second, etc. Thus, there arises a vibration- 
rotational spectrum, which may be observed in infrared light (3- 
50 w). Clearly, all transitions from one vibrational level to another 
are close to one another and yield a group of very close lines in the 
spectrum. For low resolution, these lines merge into one band. 
Each band corresponds to a definite vibrational transition. 

By increasing the frequency still further, we finally reach a new 
spectral region, which is characteristic of a molecule. This occurs 
in the optical and ultraviolet portion of the spectrum where the 
energy of a quantum suffices for the transition of a molecule from 
one electronic level to another. Here, of course, neither pure electron- 
ic transitions nor pure electronic-vibrational transitions are possi- 
ble. Electronic-rotational transitions, involving a change in “house”, 
“floor” and “apartment”, occur. Since a vibration-rotational transi- 
tion gives rise to a band, the spectrum in the optical region is 
“striped”, i.e., it consists of a system of bands. 


F Now, let us discuss the various types of molecular spectra in 
etail. 


200. The Rotational Spectrum of Molecules 


Free rotation of molecules occurs only in the gas state. Therefore, 
basic data on rotational energy levels are obtained by studying gas 
spectra. Investigation of these spectra by optical means is very dif- 
ficult. Much more suitable for this purpose is a radio-spectroscopic 
procedure that has been developed during recent years. A generator 
of electromagnetic waves transmits radiation through a waveguide* 
which is partially filled with the gas under investigation. After pass- 
ing through the gas, the electromagnetic waves arrive at a receiver 
which measures their intensity. This measurement may be performed 
over a large range of frequencies. The width of the band of frequen- 
cies generated by radio methods may be made so narrow that the 
resolving power becomes hundreds of thousands of times (!) greater 
than in the case of optical methods. Optical methods enable us to 
distinguish lines separated by 0.1 cm~?, but by radio methods we can 
distinguish lines separated by 10-° cm-1**. By means of this high 


* A waveguide is a metallic duct of rectangular or circular cross-section 
through which centimetre radio waves may be propagated with practically 
no losses. : 

** In spectroscopy, in addition to wavelength units, it is customary to 
use a reciprocal wavelength unit (wave number), i.e., the number of waves 
per centimetre, 
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resolving power, we are able to solve a number of interesting prob- 
lems which are discussed below. A rotational spectrum arises as 
a result of the quantisation of a molecule’s kinetic energy of rotation: 
2 
Krot al = ’ 

where J is the moment of inertia of the molecule. This is the form of 
the expression for the energy of a diatomic molecule. This energy is 
described by a single moment of inertia taken about an axis perpen- 
dicular to the line joining the atoms and which passes through the 
centre of inertia. As was indicated earlier (p. 77), in the general case 
the rotation is described by three moments of inertia taken about 
three main axes. 

Briefly, let us consider the rotational spectra of diatomic molecules. 

First, it should be emphasised that not all molecules, including 
diatomic molecules, will yield a rotational spectrum of radiation or 
absorption. As has been explained already (see p. 322), every radia- 
tor or absorber of electromagnetic waves is a kind of oscillator, i.e., 
an elementary dipole. If the atomic motion of a molecule or the mo- 
tion of a molecule as a whole is not accompanied by a change in 
dipole moment, such motion cannot result in the radiation or absorp- 
tion of electromagnetic waves. 

When a molecule radiates or absorbs energy, its dipole moment p 
varies periodically as the oscillation frequency. The dipole moment 
oscillates about an average value, corresponding to the equilibrium 
position of the atoms. It may be shown that the intensity of the spec- 
tral lines is proportional to the derivative (=) Laks i.e., the maxi- 
mum rate of change of the dipole moment with respect to interatom- 
ic spacing. All symmetrical molecules the atoms of which are 
joined by homopolar! bonds have a constant zero value of p. There- 
fore, they do not give rise to rotational spectra. Such molecules in- 
clude, for example, all diatomic molecules of the same atoms 
(Hz, O2, Na, etc.). 

Let us consider the rotational spectrum of a diatomic polar mole- 
cule, i.e., a molecule possessing a dipole moment. The rotational 


2 
energy of such a molecule is Kyo; = “ here w is the angular ve- 
locity of rotation and J the molecule’s moment of inertia: 


myme 5 
m+ Mg 


T=myri+mr?= ’ 
where 7; and r. are the distances to the centre of inertia andr = ry + 
+ rg. The value of w is determined from the fact that according to 
a rule of quantum mechanics (p. 490) the rotational momentum, 
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Iw, may assume only the discrete set of values 
h ya aN 


where j = 0, 1, 2, ... is the quantum number designating the rota-. 
tional levels. Therefore, the angular velocities of rotation of a mole- 
cule may assume only the following set of values: 


I aoa 
oj=55VIG+1); 
hence, 


I@? he i, 
Kyo = = a + 4)I- 


Beginning with a zero energy of rotation, the energy of successive 
levels increases in accordance with a square law. 
Energy transitions are subject to a simple selection rule, i.e., 
only transitions between neighbouring levels are allowed (Fig. 227). 
The radiation or absorption frequency 
in the rotational spectrum of a diatomic 
molecule is given by 


hj : 
Y= G=9, 43 2, acer) 


for a transition between the j and j — 4 
levels. In this simple case, the rotational 
spectrum consists of'a system of equally 
spaced lines. 

For different gas temperatures, the 


Fig. 227 average energy of rotation of a molecule 
differs. In accordance with Boltzmann’s 
law, the most probable energy is given by = kT (two rota- 


tional degrees of freedom—see p. 195). Thus, the number of 
the energy level at which a molecule is most frequently located may 
be easily calculated. For-example, in the case of a molecule of a 
hydrochloric acid vapour (J = 2.61 x 10-*° gm cm?), at tempera- 
tures of 300°, 600° and 1,200°K, we obtain j = 4, 6 and 8, respec- 
tively. 

Since transitions are possible only between neighbouring levels, 
a series of equally spaced frequencies will be grouped about the line 
of “average” j-value. The line intensity decreases as its distance from 
this j-value increases, since the number of molecules in the corre- 
sponding energy state decreases. 

Rotational spectra enable us to determine interatomic distances 
in simple molecules to a very high degree of accuracy (much greater 
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accuracy than by diffraction methods). Thus, if the number of atoms 
in a molecule is not large, the distances between atoms may be deter- 
mined if the moment of inertia and the masses of the atoms are 
known. For a diatomic molecule, 


I mym 
r= Ve: where m= —1 , 
m my me 


In the case of a hydrochloric acid molecule: 
my = 1.67x10-24 gm; mc, = 35X1.67X 10-24 gm. 


The separation between the H and Cl atoms in an HCl molecule is 
V 2.61 X 10-40 X36 X 1.67 x 10-24 
aa 35x1.67x 10748 


This value agrees closely with values obtained by other means. 


= 1.631078 cm. 
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This type of spectrum may be observed in a wavelength band 
extending from 2-3 to several score microns. For brevity, the vibra- 
tion-rotational absorption spectrum is referred to as the “infrared 
spectrum”. In the case of solids, where there is no molecular rotation, 
a pure vibrational spectrum is obtained. In the case of liquids, where 
rotation is impeded, the rotational structure of the band is smeared. 
- Diatomie Molecules. Let us disregard rotation for the present 
and consider vibrational energy levels. 

The vibration of a diatomic molecule may be visualised by means 
of a simple model consisting of two spheres joined by a spring. In 
such a system, the natural frequency of oscillations is given by 


k 
v= 


— ? 


2m m 


where & is tlre stiffness coefficient determining the binding force and 
m is the mass of an atom when the atoms in the molecule are the same; 
when the masses differ, m is the reduced mass, which is equal to 
MyMNo 
m4-+Mo 
shows that the energy of an oscillator is given by the formula 


Eé= (v +5) hy. 


Here, = hy is the zero-point energy of the oscillator, i.e., the oscilla- 


(we leave the proof of this to the reader). Quantum mechanics 


tion energy at absolute zero, and v = 0, 1, 2, ... is the oscillation 
quantum number. Moreover, it is shown in quantum mechanics 
that in the case of harmonic oscillators energy transitions may 
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occur only between neighbouring levels. In the case of nonharmonic 
oscillators transitions skipping one level or more occur, but these 
are weaker than the main transitions. Harmonic oscillations occur 
under the action of a réstoring force—kz. The potential energy of 
such oscillations is et, i.e., the shape of the curve is parabolic. 

Fig. 228 shows a potential curve (and an inscribed parabola) for 
a diatomic molecule. The horizontal lines represent energy levels 
based on theoretical calculations. 
For low values of energy, the devi- 
ation of the potential curve from 
a parabola is negligible. Such 
a molecule may be expected to 
obey the harmonic oscillator law 
as long as the vibrational energy 
is much less than the dissociation 
energy of the molecule. Under such 
conditions, the vibrational levels 
may be considered ta be equally 
spaced, and since only transitions 
between neighbouring levels are 
allowed, the diatomic molecule 
will possess a single transition frequency. If there is no molecular ro- 
tation, the entire spectrum will consist of a single line. Actually, in 
addition to the main frequency v, the spectrum contains the “over- 
tone” frequencies 2v, 3v, etc. (as the separation between levels 
decreases, the proportional trend of the overtone frequencies is 
lost). However, the overtones are weak and in very many cases we 
have a right to speak of a single vibration frequency. 

The presence of molecular rotation will transform such a spectral 
line into a band. If a molecule vibrates and rotates simultaneously, 
its energy is determined by the two quantum numbers v and j: 


i : 
e= (v+5) hve + gap i G+). 


The frequencies obtained now fall into two groups, one less than 
and one more than the vibration frequency v,;,. These groups are 
known as branches and are designated by the letters R and P. 
Taking into account the selection rules discussed above, we obtain 
the following frequency formula: 


| ae 
V = Voy ok Zar) Fal, 2s 


The plus sign corresponds to transitions to higher rotational levels 
and the minus sign to lower rotational levels. 
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This is shown in Fig. 229, which illustrates the spectral band of 
HCl. The point O corresponds to vyipy, and the vertical lines to the 
right and to the left indicate the obtained frequencies. The height of 


a line is proportional to 
the intensity at the given 
frequency. When the res- 
olution is high, each line 
appears distinct. On the 
other hand, when the res- 
olution is low, the lines 
merge into a band the in- 
tensity dependence of 
which is given by the 
envelope of the spectral 
lines. In Fig. 230, we see 
a diagram of the energy 
transitions which produce 
this band. It should be 
noted that a pure vibra- 
tional transition (from 
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j’ = 0 to j” = 0) is forbidden and as a result there is a gap in 
the middle of the band. There is an absorption maximum to the 
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right as well asto the left of the vibra- 
tion frequency. For the reason discussed 
in the preceding article, the absorption 
maxima occur for the j-values which 
are most frequently encountered at 
the given temperature. Therefore, as 
the temperature increases, the shape 
of the spectral band changes as shown 
in the diagram. 

Vibrations of a Polyatomic Molecule. 
A polyatomic molecule may execute a 
large number of vibrational motions. 
This number is equal to the number 
of vibrational degrees of freedom of the 
molecule and may be calculated as fol- 
lows. Amolecule consisting of NV atoms 
has 3N degrees of freedom. Three of 
them are associated with the coordi- 
nates of the molecule’s centre of mass. 
In the general case, the number of rota- 
tional degrees of freedom is also equal to 


three. But linear molecule. have only two rotational degrees of 
freedom since rotation about a line passing through the centres of the 
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atoms is physically meaningless. This, the number of vibrational 
degrees of freedom and, hence, the number of vibration frequencies 
is equal to 3N — 6 or 3N — 5. If the dipole moment of the molecule 
does not change for a given vibration, the corresponding frequency 
will not be manifested. (We shall return to the problem of so- 
called inactive vibrations later.) Be that as it may, the number of: 
vibration frequencies and, hence, the number of bands in the infra- 
red spectrum, is strictly determined by the number of atoms in the 
molecule and by its symmetry. 

In the absence of molecular rotation, i.e., in the case of solids, an 
infrared absorption spectrum consists of lines which correspond 
to vibrational transitions. Since, perforce, thick layers are used in 
such an investigation, there is considerable absorption under normal 
conditions and the lines merge into a band. In liquids, molecular 
rotation is retarded and the rotational structure of such a band will 
be smeared, i.e., individual lines can no longer be detected. 

Now, let us consider the physical meaning of vibrations in a poly- 
atomic molecule. Actually, what kind of vibrations occur? In the 
case of a diatomic molecule the situation was clear, i.e., we were 
dealing with vibrations along a bond line. What quantities vibrate 
harmonically in polyatomic molecules? 

For any molecular vibration, the deviations of atoms from their 
equilibrium positions may be described by displacements along 
a bond and by the distortion of bond angles. The instantaneous con- 
figuration of a vibrating molecule may be completely described by 
(3N — 6) q; coordinates (using the word coordinate in its broad 
sense). For an arbitrary choice of the q; coordinates, their values will 
not obey a simple vibration law. The law of change with respect to 
time of each g; can be represented by a complex, albeit periodic, 
curve. However, it turns out that it is possible to describe a vibrat- 
ing molecule by (3N — 6)Q; numbers which vary harmonically 
with frequencies v;. These Q; “coordinates” are called normal coor- 
dinates and the frequencies v; are called normal vibration fre- 
quencies. 

The fact that it is possible to introduce normal coordinates means 
that the periodic curves of change of any q; coordinates may be 
resolved into spectra of normal vibration frequencies. We can 
always assume that a vibration spectrum consists of normal vibra- 
tion frequencies. 

What is the nature of Q; coordinates? Are they obtained only for 
a particular choice of coordinate system? The answer to the latter 
question is no. First and foremost, normal coordinates are linear 
combinations of gq; displacements. Therefore, a normal coordinate 
describes the vibration of a molecule as a whole. Examples of nor- 
mal vibrations are illustrated in Figs. 231 and 232 for CO. and H2O 
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molecules. The actual vibration of a molecule is the resultant of the 


indicated motions. 
The normal vibration frequencies of a molecule can be determined 


from its spectrum. These can then be used to obtain a clear picture 


of the molecular vibrations. 0 
The characteristic nature of many 
vibration frequencies is of great prac- we a 
. . on 
tical importance. Careful study has H sw wo & 
@) H 
0 C 0 
om ba E tf 
\ c a 
mS NX 
2 
oN . ag 
————— p : 
Na. 
= 
~~ 
oe Pee 
gn = 
c) a. es 
a 
\ ar? 
vi) ®) (A 
Fig. 231 Fig. 232 


shown that basically in certain normal vibrations only one in- 
teratomic spacing or one bond angle varies. If a molecule preserves 
that bond, such a frequency varies little in a group of related com- 
pounds. This fact is utilised in chemistry. 

The vibration frequencies of a molecule are measured not only by 
means of infrared absorption spectra but by means of Raman spectra 
as well. As will be seen below, these two methods effectively sup- 


plement each other. 


202. Raman Scattering of Light 


Raman scattering refers to the particular case of the scattering of 
light of frequency v by a substance when, in addition to the strong 
scattering of light of constant frequency v, there appears a series 
of lines of lower and higher frequencies. , 

Usually, observations are made at right angles to the incident 
light. A mercury lamp provides the required radiation. The spectrum 
of this radiation contains several intense lines, the most important 
of which is a blue line corresponding to a wavelength of 4,358 A. 
By means of a spectrograph, one can obtain a photograph of the 
scattered radiation spectrum. Such a photograph is shown in Fig. 233. 
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The main characteristic of such a spectrum is the following. About 
each excited line there appear identical groups of considerably 
weaker lines. These satellites are usually spaced symmetrically to 
the right and to the left, but they may differ in intensity. This phe- 
nomenon was discovered independently by Raman in India and 
Landsberg and Mandelshtam in the Soviet Union. Raman's work, 
however, was published first*. Hence, such spectra are called Raman 
scattering spectra. 

The spectral pattern can be explained as follows. Basically, a pho- 
ton hv is scattered by a molecule elastically, i.e., the frequency 

remains constant. However, in addi- 

we ee EE pea cero ~~ «tion to such scattering, it is also pos- 
e At FAL A fb i sible to have scattering with some loss 
ae i eens of energy; such energy may be expend- 
opie: aa ae ed in the transition of a molecule 

Se ee Rae from one level to another. Let us as- 
L cas —~ =: sume that a photon hv has lost an 

amount of energy equal to that 

Fig. 233 required to raise a molecule from the 

zero vibrational level to the first level. 

The energy loss is 6:=1 — Gr=0= Avon. Therefore, the scattered 

photon has an energy h (v — vp;y). An associated line or “satellite” 
appears in the spectrum on the side of lower frequencies. 

The lower frequencies, v — vzin, are called red satellites and the 
higher frequencies violet satellites. Scattering with a frequency 
greater than v occurs when a photon hits an excited molecule. In 
such a case, the photon is scattered, but it simultaneously gains the 
“extra” energy due to the transition of the molecule to a lower level. 
If the excited molecule was at the first vibrational level, the photon 
increases its energy by hvyip and the frequency v ++ vyjp appears in 
the spectrum. 

This scattering mechanism excellently explains the difference 
between the intensities of the red and violet lines. At room tempera- 


ture, most molecules are at the zero level with an energy + hvvin- 
A smaller number of molecules are at the first excited level with an 


3 aos ‘ . 
energy = voip. Therefore, it is clear that the intensity of the violet 


lines must be less. Moreover, at low temperatures the violet lines. 
practically disappear. The ratio of the intensities of the violet lines 
to the red lines is proportional to the ratio of the number of atoms. 
in the first state to the number in the zero state. According to the 


* Raman sent a telegram about his discovery to the British journal Nature- 


202. Raman Scattering of Light 527 


Boltzmann law, 


3 Voip 
N ~2 nT RYyib 
ost _ & Zap UT 
Ny=o A i AVoib 

2 kT 


This formula agrees excellently with experimental results. 

Thus, the displaced lines of a Raman spectrum are shifted by 
amounts equal (in energy units) to the difference between the energy 
levels of the given molecule. 

We have discussed the case of two vibrational levels, but it is clear 
that the discussion is valid for all energy transitions—pure rotation- 
al, vibration-rotational, etc. The satellite lines closest to the main 
scattering line correspond to the lowest energy transitions. The 
rotational spectrum is located considerably closer to the main line 
than the vibration-rotational spectrum. 

In the case of absorption spectra, the selection rules for the oscilla- 
tion quantum number are the same as those for Raman scattering, 
but the selection rules for the rotational quantum number are differ- 
ent. Transitions are allowed for which 


Av=+1 and Aj=0, +2. 


Thus, the vibration-rotational band consists of a pure vibrational 
line displaced from the excited line by vp;, and a series of lines dis- 
placed from the excited line by voip — 2vro¢ and Voin + 2Vrot- 

Raman spectra are usually obtained by scattering from liquids. 
The lines vy:p ++ 2v,o: appear smeared, but the lines of the pure 
vibration spectrum are distinct. 

Raman spectra have an important advantage over infrared spec- 
tra. The measurements are transferred, so to speak, to the visible 
region. The frequencies which were measured directly in the infrared 
spectrum are determined as the difference between the main line and 
the Raman line with approximately the same accuracy. 

It would seem that one could dispense with the infrared spectrum. 
However, this is not always the case. In certain respects, the infrared 
and Raman spectra supplement each other. 

What is the difference between the process of wave radiation from 
a molecule and the process of scattering from a molecule? In both 
cases a molecule sends wavelets into space, i.e., in both cases a mol- 
ecule behaves during radiation like a dipole. However, in the first 
case a molecule behaves like a dipole in the absence of an external. 
field, while in the second it behaves like a dipole when acted upon 
by the field of an incident wave. Thus, radiation or absorption will 
occur when changes in the state of a molecule (vibration, rotation, 
etc.) are accompanied by changes in the induced dipole moment, 
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i.e., in the polarisability B. Theoretical calculations indicate that 
such a change should occur when the configuration of the molecule 
passes through equilibrium. 
Lines will occur in an infrared spectrum for vibrations which sat- 
isfy the condition 
(22), 0 
r=" 


dr 


Raman lines will occur for vibrations which satisfy the condition 
dB 
(+) eau 


Quite often these conditions exclude each other. Therefore a certain 
vibration may be active in the infrared spectrum but inactive in 
the Raman. spectrum, and vice versa. 

A CO, molecule may serve as an example. One of such a molecule’s 
three vibrations—the linearly symmetrical vibration—leaves the 
dipole moment unaltered and equal to zero. That vibration is inac- 
tive in the infrared spectrum. In the Raman spectrum, on the other 
hand, only that vibration will be active; the other two will be absent. 
In the case of an anti-symmetrical vibration, one may reason as 
follows: In both extreme positions, the deformation of the electron 
cloud, and hence the polarisability, is the same. During vibration 
the polarisability changes in the same manner in both half-periods, 
and at the equilibrium position passes through a minimum or max- 
imum, but this does not mean that 


We shall not discuss these regularities any further. They have 
been studied in detail and the results are available in tabular form. 
Such tables enable us to determine from the symmetry of a molecule 
the number of vibration frequencies in-its infrared and Raman spec- 
tra. The converse is also true, namely, the symmetry of a molecule 
can be determined from the number of lines in its spectra. 


203. Electron Spectrum 


Let us consider absorption spectra in which the transitions are in 
the visible and ultraviolet regions. The magnitude of a photon of 
the incident light will be of the same order of magnitude as the 
difference between the electronic levels of a molecule. Electron 
transitions become possible. However, as has been indicated already, 
electron transitions are accompanied by changes in vibrational and 
rotational energy. Therefore, a very broad band is associated with 
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every such transition. Moreover, under normal experimental condi- 
tions this band is continuous, i.e., its “vibration-rotational” struc- 
ture is not discernible. Each electron transition band contains nu- 
merous narrow vibration-rolational transition bands, whereby all] 
changes of the oscillation quantum numbers are possible. 

The properties of a molecule in an excited electronic state differ 
from its properties when it is at a zero electronic level. When 
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a molecule becomes excited, the system of vibrational and rotational 
levels and hence the vibration frequencies, i.e., the differences 
belween the vibrational levels, change. Also, the shape of the poten- 
tial curve and the equilibrium spacings between atoms change. 

Absorption curves in the visible and ultraviolet regions are suf- 
ficiently characteristic to be used in identifying substances. 

The dependence of the absorption of light on the thickness of 
a layer of substance may be expressed as follows (cf. p. 120): 


T=Ipe-"*, 
where J is the intensity of the incident beam, J the intensity of the 


transmitted beam, x the thickness of the layer, and w the absorption 
coefficient for light. The value of the absorption coefficient depends on 


the wavelength of the incident light. A curve of + as a function 


0 
of wavelength is sometimes called an absorption curve. Usually, 
however, this term refers to a curve of p as a function of A or v. 


34—1409 
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The relation for the absorption of light in solutions may be writlen 
in the form 


T= LoeahNt or [= Ioe—e¢, 


where J is the distance traversed by the beam in the substance and 
kN and ec are expressions for the absorption coefficient of the solu- 
tion. It is quite reasonable to assume that the absorption cocl- 
ficient is proportional to the concentration of the substance, which 
may be expressed as the number of molecules NV per unit volume or 
the number of moles of substance c per litre of solution. In the case 
of solutions, the term “absorption curve” usually refers Lo a curve 
showing the dependence of the coefficient A or € on A. 

Examples of absorption curves in the visible and ultraviolet spec- 
trum are shown in Fig. 234. Curve J is for Congo red, 2—for aniline, 
$—for phenol and 4—for benzene. 


CHAPTER XXX 
ATOMIC NUCLEI 
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Investigation of the structure of atomic nuclei is inseparably 
linked with the study of spontaneous and induced decay of atomic 
nuclei and nuclear particles. By studying the fragments of a disin- 
tegrating atomic nucleus and tracing the fate of these particles, we 
can make certain inferences about the structure of the nucleus and 
about the nuclear forces. 

It is not surprising that the spontaneous decay of nuclei, i.e., 
natural radioactivity, was the first to be studied in detail. Concurrent- 
ly, physicists began to study cosmic rays—radiation from outer 
space possessing extraordinary penetrating power. In interacting 
wilh matter, cosmic particles behave like projectiles. For a long 
time, cosmic ray investigations were the primary means of studying 
transformations of elementary particles and to a cerlain extent of 
studying alomic nuclei. At present, streams of particles created in 
accelerators are the primary means of studying the disintegration 
of atomic nuclei. 

The experimental methods to be discussed below are applicable 
to the study of cosmic rays and particles created during the nuclear 
bombardment of one or another target. 

Wilson Cloud Chamber. When a fast particle passes through 
a chamber containing supersaturated vapour and creates ions along 
ils path, a track is left that is very similar to the “tail” occasionally 
seen in the sky after a plane has passed. This track is produced by 
condensation of the vapour. The ions along the path of the particle 
are centres of vapour condensation; as a result the track is easily 
detected. The track of the particle may be photographed or ob- 
served directly. 

The state of the vapour in the chamber may be controlled by 
varying the volume of the chamber. This is achieved by means of 
a piston. Rapid adiabatic expansion brings the vapour to a state of 
supersaturalion. 

If a Wilson cloud chamber is located in a magnetic field, the veloc- 
ily of a particle in the chamber may be determined from the curva- 
ture of its path if e/m is known or, conversely, e/m may be deter- 
mined if the velocity is known (see the formula on p. 444). 
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Jonisation Counters and Jonisation Chambers. An ionisation device 
used in radiation investigations usually consists of a cylindri- 
cal condenser filled with gas. The cylindrical casing constiltules one 
electrode and a thin insulated wire placed along the axis of the cylin- 
der the other (Fig. 235). The magnitude of the vollage applied to the 
condenser and the pressure of the gas in the counter must be chosen 
in a special manner, depending on the nature of the problem. In 
one such widely used device, known as a Geiger counter, a voltage 
equal to the breakdown voltage is applied 
between the cylinder and the wire. When 
an ionising particle enters such a counter 
through the wall or insulator, there passes 
through the condenser a pulse of cur- 
rent which continues to flow until the pri- 
mary electrons and the created electrons 
and ions of the self-maintained discharge 
reach the positive casing of the condenser. 
This pulse of current can be amplified by 
ordinary radio-engineering means and the 
passage of the particle through the coun- 
ter may be determined by a click or light 
flash, or by means of a digital counter. 

A digital counter can be used to deter- 
mine the number of particles entering the 
instrument. This requires that the pulse 

Fig.” 235 of current due to one particle cease by 

the time the next particle enters the 

counter. If the operating conditions of the counter are not chosen 

properly, the counter begins to “choke” and count incorrectly. The 

resolving power of an ionisation counter has an upper limit. This 

limit, however, is quite high, i.e., up to 10,000 particles per second 
may be counted. 

If we decrease the voltage, the pulse of current passing through 
the condenser may be made proportional to the number of created 
ions (proportional counter). For this purpose, it is necessary to oper- 
ate in a region where the gas discharge is not self-maintained. Pri- 
mary electrons’moving in the electric field of the condenser accumu- 
late energy, whereupon ionisation by collision commences and new 
ions and electrons are produced. The first n pairs of ions produced 
by a particle entering the counter are transformed into kn pairs of 
ions. When the operating conditions are such thal the discharge is 
not self-maintained, the amplification factor & is constant. Thus, 
a proportional counter not only establishes that a particle has passed 
through the counter, but also measures the ionising power of the 
particle. 


‘. 
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Just like in the Geiger counter described above, discharge in 
a proportional counter ceases when no more ionisation takes place. 
The distinguishing feature of a Geiger counter is that a particle 
entering such a counter behaves like a trigger mechanism, and the 
breakdown time is independent of the primary ionisation. 

Since a proportional counter is sensitive to the ionising power of 
a particle, the operating conditions of the counter may be chosen 
so that only certain kinds of particles are recorded by the instrument. 

If the operating conditions of the instrument correspond to satura- 
lion current (achieved by decreasing the voltage), the current through 
the counter becomes a measure of the radiation energy absorbed 
in the instrument per unit time. In such a case, the device is galled 
an ionisation chamber. The amplification factor & is then equal to 
unity. The meril of an ionisation chamber is its high stability of 
operation. The design of an ionisation chamber varies considerably 
from case to case. The choice of chamber filling, wall material and 
number and shape of electrodes will depend on the purpose of the 
investigation. Chambers vary in size from about a cubic millimetre 
to several hundred cubic metres. Under the action of a constant 
source of ionisation, currents ranging from 107!’ to 10-* ampere are 
produced in ionisation chambers. 

Scintillation Counters. The method of counting elementary par- 
ticles by the flashes of a fluorescent substance (scintillations) was 
first used by Rutherford in his classical investigations of the struc- 
ture of atomic nuclei. Modern instruments bear little resemblance 
to the simple device used by Rutherford. 

A particle impinging on a phosphor* may produce a flash of light. 
There exist a large number of organic and inorganic substances which 
are capable of transforming the energy of charged particles and 
photons into luminous energy. The duration of afterglow in many 
phosphors is very short—of the order of a thousand millionth of 
a second. This makes it possible to construct scintillation counters 
with large counting rates. The light yield of a number of phosphors 
is proportional to the energy of the incident particles. Such phosphors 
find application in the construction of counters for the determination 
of the energy of particles. 

In modern counters phosphors are combined with photomultipli- 
ers having ordinary pholocathodes that are sensitive to visible light. 
The electric current produced in a photomultiplier is amplified and 
fed to a counting device. 

The most widely used organic phosphors include anthracene, stil- 
bene and terphenyl. These chemical compounds belong to the class 


* Phosphors constitule a large group of solids which, generally speaking, 
do not have anything in common with the chemical element phosphorus. 
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of so-called aromatic compounds, which contain rings of six carbon 
atoms. For use as scintillators, these substances must be obtained in 
the form of monocrystals. Since large crystals are rather difficult lo 
grow and since crystals of organic compounds are very fragile, the 
use of plastic scintillators, i.e., solid solutions of organic phosphors 
in transparent plastics such as polystyrene and other such high- 
polymer materials, is of considerable interest. Of the inorganic phos- 
phors, halides of alkali metals, zinc sulphide and tungstates of alkali 
earth metals find application. 

Cherenkov Counters. As far back as 1934, Cherenkov showed that 
when a fast particle moves in a perfectly pure liquid or solid diclec- 
tric a peculiar luminescence occurs. This luminescence basically differs 
from fluorescence, which is related to energy transitions in atoms 
of the substance, and from bremsstrahlung of the continuous X-ray 
spectrum type. Cherenkov radiation occurs when a charged particle 
moves with a velocity exceeding the phase velocity of the propaga- 
tion of light in the dielectric. The distinguishing feature of this 
radiation is that it is propagated along a conical.surface in the prop- 
agation direction of the particle. The angle of the cone may be deter- 
mined from the formula : 


cos 0 = 7 ; 

where @ is the angle between the surface of the cone and the direc- 
tion of motion of the particle, V is the velocity of the particle, and 
v is the velocity of light in the medium. Thus, for a medium having 
a given index of refraction m, there exists a critical velocity, V = 


c Ps . 7 . “45 . 
a es below which no radiation occurs. At this critical velocity, 


the radiation is parallel to the direction of motion of the particle. 
For a particle moving with a velocity very close to the velocity of 


‘ reer 1. . 
light (V =c), the angle of radiation 0 = arccos = is a maximum. 


In the case of cyclic hexane, n = 1.437 and @ -= 46°. 

Theoretical calculations and experiments show that, in the 
main, the Cherenkov radiation spectrum is located in the visible 
region. 

Cherenkov radiation is a phenomenon similar to the formation of 
a nose wave near a moving ship; in such a case, the velocity of the 
ship is greater than the velocity of the waves on the surface of the 
water. 

Fig. 236 illustrates how Cherenkov radiation is formed. A charged 
particle moves along the axial line and the electromagnetic wave 
following the particle temporarily polarises the medium at points 
of the particle path. All such points become sources of spherical 
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waves. There exists a certain angle for which these spherical waves 
coincide in phase and form a single front. 

Let us consider two points along the path of the charged particle 
(Fig. 237). Two spherical waves have been created—one at the in- 
slant of time f and the other at the instant of time ¢ + t. Clearly, 
t is the time required for the particle to traverse the distance between 
these two points. In order for these two waves to be propagated at 
an angle 0 in the same phase, 
the time of travel of the first 
beam must be greater than the 
time of travel of the second 
by t. During the time t, the 
particle traverses a distance 
Vt and the wavea distance vt. 


Thus, we obtain the formula 
v 


cos 0 = va . 
Cherenkov radiation is wide- 
ly used as a means of regis- Fig. 236 


lering elementary particles. 
Counters based on this phenom- 
enon are known as Cherenkov 
counters. Like scintillation 
counters, they contain a lumi- 
nescent material, photomulti- 
pliers and amplifiers of pho- 
toelectric current. Various 
types of Cherenkov counters Fig. 237 

have been designed. 

Such counters have numerous merits. These include a high count- 
ing rate and the ability to determine the charge of particles moving 
with velocities very close to that of light (the curve of light yield 
as a function of particle charge rises sharply). Only by means of 
Cherenkov counters is it possible to solve such important problems 
as the direct determination of the velocity of a charged particle, the 
determination of the direction in which an extremely fast particle 
moves, elc. 


Arrangement of Counters. To study transformation and interac- 
tion processes of elementary particles, one must be able to detect the 
emergence of a particle at a particular point and trace its subsequent 
course. Various arrangements of standard counting circuits are used 
in solving such problems. For example, two or more counters may be 
electrically connected in such a manner that a count occurs only 
when a discharge begins in all the counters at exactly the same time. 
This may serve to indicate that a certain particle has passed through 
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all the counters. Such an arrangement of counters is known as 
a “coincidence” arrangement. 

Nuclear-Emulsion Method. A gelatinous film containing micro- 
crystals of silver bromide may serve as the photosensilive layer of 
a photoplate. Basically, the photographic process consists in the ion- 
isation of these crystals, resulting in the reduction of the silver bro- 
mide. This process occurs under the action of light or of charged par- 
ticles. A concealed track is formed in the emulsion when a charged 
particle passes through it. This track may be seen after the photo- 
plate is developed. Photoemulsion tracks tell us a great deal about 
the particles producing them. Thus, strongly ionising particles leave 
heavy tracks. Moreover, since the ionisation produced depends on 
the charge and the velocity of the particle, considerable information 
may be obtained by simply examining the appearance of the track. 
The range of a particle in the photoemulsion is also a source of valu- 
able information. By measuring the length of the track, one may 
determine the energy of the particle. 

Ordinary photoplates, having: thin layers of emulsion, are hardly 
suitable for nuclear investigations. Such plates would register only 
those particles the motion of which is strictly in the plane of the 
plate. Misovsky and Zhdanov in the Soviet Union, and several years 
later Powell in Britain, introduced the use of photoplates having an 
emulsion thickness of almost 1 mm (one hundred times greater than 
the thickness in ordinary plates). The complex transformations 
occurring when a particle disintegrates are registered in clear visual 
form in the photographic method. 

Fig. 238 shows a typical photograph obtained by this method. 
Nuclear transformations have occurred at P and S. 

In a recent modification of this method, an emulsion chamber of 
considerable volume is used as the medium for registering the parti- 
cle track. 

Methods of Analysing Observations. Using the described instru- 
ments, an investigator is able to determine the most important con- 
stants of an elementary particle, viz., velocity, energy, electric 
charge and mass. All these parameters may be determined very accu- 
rately. Moreover, when a stream of particles is available, it is possi- 
ble to determine the spin and magnetic moment of an elementary par- 
ticle. For this purpose, a magnetic field is used to divide the beam 
(see p. 495). 

It should be recalled that only charged particles can be observed 
directly. Our knowledge of neutral particles and photons is obtained 
indirectly, i.e., by determining how these invisible particles 
affect charged particles. Nevertheless, our knowledge of invisible 
particles is highly reliable. ; 
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The laws of conservation of momentum and energy have wide 
application in the investigation of elementary particle transforma- 
tions. Since we are dealing with fast particles, changes in mass must 
be taken into account in applying the law of conservation of energy. 

Consider a particle track in which “branching” occurs. This signi- 
fies that the first particle has been transformed into two particles. 
Therefore, the following relations must hold. First, the momentum of 
the first particle must be equal to the vector sum of the momenta 
of the created particles: 


P1=D2+ P3- 
Secondly, 


K,=K,+K3+A6, 


where AG = c?Am (the increment Am is the difference in mass be- 
tween m2 + m3 and m,). 

Nuclear physics experiments show that the laws of conservation 
are strictly obeyed in all elementary particle transformations. Hence, 
these laws may be used to explain the properties of neutral parti- 
cles, which do not leave tracks in a photographic emulsion and do 
not ionise a gas. When an investigator observes two diverging tracks 
on a photoplate, he knows that at the branching point a neutral 
particle transformation has occurred. By determining the momen- 
tum, energy and mass of the created particles, he may reliably 
ascertain the values of the parameters of the neutral particle. This 
is how the neutron was discovered and how neutrinos and neutral 
mesons are studied (see below). 
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Atomic nuclei consist of protons and neutrons. The basic character- 
istics of a proton, like any other elementary particle, are its charge, 
mass, spin and magnetic moment. A proton has a positive elemen- 
tary electric charge, i.e., its charge is equal in magnitude, but 
opposite in sign, to the charge of an electron. Its mass is equal to 
1.6724 x 10-4 gm, which is 1,836 times greater than the mass of 


an electron. It has a spin of + and a magnetic moment of 1.41 X 
x 1075 Gaussian unit. 

A neutron has a somewhat greater mass than a proton, i.e., its 
mass is equal to 1.6737 x 10-*4 gm. It also has a spin of a Its mag- 


netic moment is anti-parallel to the spin and equal to 0.966 x 10-°5 
Gaussian unit. 

A neutron does not have an electric charge and does not leave 
a track in a Wilson cloud chamber or on a photoplate. The properties 
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of a neutron are primarily determined by studying collisions between 
neutrons and various nuclei. Knowing the mass and velocity of 
a nucleus which collides with a neutron, one can determine the 
velocity Urey: and the mass AJpeu,. Thus, in accordance with the laws of 
elastic impact (see p. 67), we obtain 


2M neut 
i; = = _Vneut 
nucl Mneut+Mnuct 8" 


where Anew aNd Upeug are unknown quantities. By studying colli- 
sions between neutrons and various nuclei, one can roughly determine 
Mnyeut if it is assumed that the initial velocity Uneu: is the same in 
different collisions. A precise value for A7,eus may be determined from 
the values of the mass defect of nuclear reactions (see below). 

The spin and magnetic moment of neutrons have been directly 
determined from very interesting measurements on a stream of neu- 
trons passing through magnetised iron. However, we shall not de- 
scribe these measurements. 
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The number of protons Z in the nucleus of a given element deter- 
mines the chemical properties of the element, and the position of 
an atom in the Mendeleyev periodic table is determined precisely 
by the number of its protons Z. 

A chemical element may have several isotopes, which differ from 
One another with respect to the number of neutrons in the nucleus. 
An isolope of a ‘given element may be described by its mass num- 
ber AZ, the total number of protons and neutrons in its nucleus*. 
Thus, the number of neutrons in a nucleus is equal to M — Z. 

Ordinary natural substances are a mixture of isotopes. The isotop- 
ic composition of a natural substance generally remains fixed and 
is, therefore, characteristic of the given chemical element. Frequent- 
ly, one of the isotopes of the mixture predominates. For example, 
hydrogen is encountered in nature in the form of ordinary hydrogen 
H? and deuterium H? = D, whereby the percentage of the former 
is 99.98% and that of the latter 0.02%. The percentage of O1* in 
nalural oxygen is 99.76%. In natural uranium the main isotope 
is U8; its percentage is 99.28%. 


* The nucleus of an isotope of a given chemical element is denoted by the 
symbol of the element. The mass number is indicated by a superscript to the 
right. A subscript to the left is frequently used to indicate the atomic number 
Z of the element, but this is not necessary since the chemical symbol deter- 
mines Z. Thus, the three isotopes of oxygen may be denoted by the symbols 024, 
O17 and O18 or g0'8, gO17 and g018. The nuclei of these isotopes contain 8, 9 
and 10 neutrons (Jf — Z), respectively. 
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Let us represent the mass of the lightest and most prevalent isotope 
of oxygen by My. The magnitude Mo is called an atomic weight 
unit. It is customary to express the atomic weight A of isolopes and 
elements in such relative units. Precise measurements indicate that 
an atomic weight unit is equal to a mass of 1.662 x 10-7" gm. The 
absolute value of the mass of an isotope may be determined from 
the formula. 


M, = 1.662 « 10°! A gm. 


The mass of a proton is 1,836 times greater than the mass of an 
electron. Therefore, the mass of an atom and the mass of its nucleus 
are almost equal. However, in a number of cases, particularly for 
light atoms, this difference may be determined by means of modern 
measuring methods and should be taken into account. It is evident 
that the following relationship exists between the mass of an alom 
MM, and the mass of its nucleus Wy. 


My=M,—Zm. 


In atomic weight units, m = 5.5 x 10-4*. Thus, the difference be- 
tween My and AM , is of the order of several hundredths of a per cent 
of these masses (for heavy atoms—of the order of several thou- 
sandths of a per cent). 7 

The relative atomic weight of an isotope is approximately equal 
to its mass number. For example, the mass of H? is equal to 1.00812, 
of D? to 2.01472, of Ne”® to 19.9981, etc. 

The following important conclusion may be drawn from a careful 
study of a table of isotope masses: the mass of a nucleus is less than 
the sum of the masses of its constituent particles. For example, the 
mass of a neutron is 1.00893 and the mass of a proton is 1.00812; 
hence the sum of the masses of two neutrons and two protons is equal 
to 4.0341; but the mass of a helium atom, consisting of two neutrons 
and two protons, equals 4.0039. Thus, the mass of a helium nucleus 
is 0.0302 of an atomic weight unit less than the sum of the masses 
of the constituent particles of the nucleus. This value is a thousand 
times larger than the accuracy of the measurements. 

This difference between the sum of the masses of the constituent 
particles of the nucleus and the mass of the nucleus is an important 
example of mass defect. Every nucleus has a specific mass defect. 

One of the most important conclusions of the theory of relativity 
is the principle of the equivalence of mass and energy (see p. 424). 
This principle states that if a system acquires or loses a quantity of 
energy A@, the mass of the system increases or decreases, respec- 


* This value is obtained by dividing the mass of an electron in grams by 
1.662 x 10-24 gm. 
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tively, by Am == AG/c?. The mass defect of a nucleus is easily 
explained by means of this principle, i-e., it isa measure of the bind- 
ing energy of the nuclear particles. 

Let us see what this means. In chemistry and physics, binding ener- 
ey is understood to mean the energy required to completely break 
a bond. If a nucleus could be divided into ils constiluent parti- 
cles, the mass of the system would increase by the value of the mass 
defect Am. From the viewpoint of Binstein’s law, this means that 
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Fig. 239 


the nucleus has been given an energy A@ = c?Am, which is equal 
to the binding energy. Hence, a change in mass of one atomic weight 
unit is equivalent to a change in energy of 


1.662 x 107-24 x 9 x 102° erg = 1.496 x 10-3 erg =931.8 Mev 


(1 ev = 1.601 x 107° erg and 1 Mev = 10° ergs). Using these val- 
ues and knowing the values of the mass defect, one can easily calculate 
the binding energy of an atomic nucleus. 

Fig. 239 shows a curve of binding energy per nuclear particle, i.e., 
c?A m/M, as a function of mass number. It is seen that the binding 
energy per nuclear particle first rises rapidly, although not quite 
uniformly, then remains at approximately 8 Mev, and finally drops 
slightly for the last elements in the Mendeleyev periodic table. The 
following conclusion may be drawn from the fact that the energy 
remains constant at 8 Mev over a large portion of the curve: Since 
the binding energy per particle does not depend on the total number 
of particles in a nucleus, interaction in a nucleus must occur only 
between particles that are very close to each other. It follows that 
nuclear forces between particles become effective only when one 
particle approaches another very closely (see below). 

It is instructive to compare the 8 Mev value with the chemical 
binding energies of molecules. The latter are usually equal to sever- 
al electron volts per atom. Thus, the energy required to break up 
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a nucleus is several million times greater than the energy required 
to break up a molecule into atoms. 

Nuclear forces will be discussed in greater detail below. It is 
already clear from the above examples, however, that these forces 
reach tremendous values when a nucleus breaks up. Jt is also clear 
that nuclear forces constitute a new kind of force, since they can 
bind together particles the electric charges of which are of the same 
sign. Nuclear forces are not reducible to electrical forces. 
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Substances in which atomic nuclei are continuously disintegrating 
are said to be radioactive. Two kinds of radioactive disintegration 
exist: a-decay and f-decay. 

Alpha decay consists in the ejection of an a-particle from an atom- 
ic nucleus. Since an a-particle consists of two protons and two neu- 
trons, its symbol is ,He*. Thus, a-decay may be represented as 
follows: 


EI = ,_.Ei"—‘ + He!, 


where El] is an arbitrary chemical element. This disintegration reac- 
tion, as well as other nuclear reactions to be considered on p. 567, 
obeys the law of conservation of charge (the sum of the subscripts in 
the right member of the equation equals Z) and the law ‘of conserva- 
tion of mass number (the sum of the superscripts in the right mem- 
ber of the equation equals J/). 

Beta decay consists in the ejection of an ordinary electron (B~) or 
a positron (6*) from a nucleus. It should be recalled that the masses 
as well as the magnitudes of the charges of these two particles are 
equal. The ejection of such a light electrically charged particle 
from a nucleus results in the nuclear transformation of a proton into 
a neutron or a neutron into a proton. Such a transformation ensures 
the conservation of electric charge upon disintegration. 

Beta decay may be represented as follows: 
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Thus, for B--decay, a neutron of the nucleus is transformed into a pro- 
ton (the number of protons increases); for $*-decay, the reverse 
transformation takes place. : 7), 

In addition, y-rays, i.e., electromagnetic radiation of shorter 
wavelength than X-rays, may be emitted in both types of radio- 
active disintegration. Alpha radioactivity occurs only for the heavy 
elements (beginning with bismuth); B~-radioactivily is encoun- 
tered considerably more often than f*-radioactivity. 
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A radioactive substance found in nature is said to be naturally 
radioactive; a radioactive substance obtained by means of nuclear 
reactions is said to be artificially radioactive. 

If another clement is formed when the nucleus of a radioactive 
element decays, and if a third element is formed from the second, 
ele., the sequence of such elements is called a radioactive series. Four 
radioactive series, beginning with U*8, Th?*, U?8> and U*88*, are 
known. 

Radioactive decay obeys the law 


N = Noe7*!, 


where Ny is the number of nuclei present at the initial instant ¢ = 0, 
N the number of nuclei remaining (not disintegrated) after the elapse 
of a time t, and 4 a radioactive-decay constant, which is a constant 
for a given element. 

It is easily seen that the time interval 7 during which half of 
the number of atoms present decay is given by 


2 _ 0.693 


The time interval 7 is known as the half-life period or simply 
half-life of the radioactive element. 

The predecessors of the naturally radioactive series have a half- 
life lying in the range of 108-10! years. On the other hand, the half- 
life of intermediate decay products and artificially radioactive ele- 
ments may be equal to an extremely small fraction of a second. 

A quantily of radioactive matter could be expressed, of course, in 
grams. However, it is easier and more convenient to describe a quan- 
tity of radioactive matter by its activity, i.e., the numberof decay 
events per second. The curie is a historical unit of measurement 
which is equal to 3.7 x 10!° decays per second. In laboratory work, 
this unit is found to be inconveniently large; hence, the millicurie, 
one-thousandth of a curie, is often used. Another unit used is the 
rutherford, which is equal to 10° decays per second. Thus, 1 milli- 
curie = 37 rutherfords. 

If the half-life 7 of a substance is known, its initial radioactivily 
can be easily determined. The fraction of substance decaying in 
1 second is equal to 


or, since A is a small number we obtain, 


N _ _ 0.693 
Noe r 


* U233 has predecessors among the transuranic elements. 
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In the case of m grams of substance of atomic weight A, the activily 
is equal to 
0.693 — m decays 
T “AK1.66 * 10724 see * 

It is generally assumed that the radioactive products found during 100 days 
of operation of a nuclear reactor (see below) amount to [| curie per watt. For 
500,000 kW, the quantity of decay products is equal to about 5v0 gin, i.e., 
10~° gm per watt. Taking the average atomic weight of the fission products as 


100, we find by means of the above formula that the average half-life of the 
radioactive products is equal to 105 seconds, i.e., about 24 hours. 


We cannot describe fully the properties of a radioactive substance 

by means of its half-life and activity alone. We must indicate, in 

addition, whether the substance is an 

a-partlicle or #-particle radiator and 

whether the decay is accompanied by 

y-radiation. An even fuller descrip- 

tion requires that we specify the energy 

, of the particles ejected from the nuclei 

and the energy of the radiation. The 

properties of g-particles radiated from 

one radioactive material differ very 

r little from the properties of a-particles 

radiated from another radioactive 

material. The inilial velocities of such 

a-parlicles lie in the range of 15,000- 

20,000 km/sec and the number of pairs 

of ions formed in air by such an a- 

Fig. 240 particle lies inthe range of 1 10°-2 x 

x 10°. The energies of f-particles eject- 

ed during decay are distributed con- 

tinuously from zero to several hundred or thousand kev. The ener- 

gies of y-rays emitted by one radioactive substance differ from the 

energies of y-rays emitted by another radioactive substance, but 
their order of magnitude remains the same for all elements. 

Now, let us direct our attention to the theory of a- and p-decay. 

In alpha decay, an c-particle tunnels through a potential barrier 

and is then subjected lo electrostatic repulsion. The potential curve 

of a nucleus is illustrated in Fig. 240. We see a potential well and 

a potential barrier (beyond the barrier the electrostatic potential 

energy drops hyperbolically). It has been shown in the case of a radio- 

active element by scattering a-particles from nuclei of this ele- 

ment that the height of the potential barrier is at least 9 Mev. Never- 

theless, particles having an energy of only 4 Mev escape from nuclei 

by tunneling through the barrier. 


U 
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Such a model explains why one radioactive element has a very 
short half-life and another has a very long half-life. If the difference 
between the energy of an a-particle in a nucleus and the height of 
the potential barrier changes even slightly, the probability of 
a-particle leakage through the barrier changes radically (see the 
formula on p. 485). 

Now, let us consider B-decay. The existence of the neutrino, a par- 
licle which eluded detection for a long time, was first hypothesised 
in explaining this process. 

Two arguments immediately present themselves in support of the 
existence of the neutrino. First, such a particle is required to satisfy 
the law of conservation of angular momentum and, secondly, it is 
required to satisfy the law of conservation of energy. We know that 
neutrons, protons, electrons and positrons have a spin of 3/,. As has 
been indicated, B-decay in an atomic nucleus transforms a proton 
into a neutron, or vice versa. Since the number of nucleons in a nu- 
cleus remains unchanged, f-decay cannot transform an even spin into 
an odd one. But this is precisely what would be required if only an 
electron, having a spin of 1/,, were ejected during B-decay. This con- 
tradiction was resolved by hypothesising the existence of the neutri- 
no, a particle having a spin of */5. In addition, by means of the neu- 
trino, we can explain why the f-particle spectrum is continuous. 

If B-decay consisted in the ejection of only an electron, this elec- 
tron would have a well-defined energy since the initial and final 
energy levels, i.e., the energies of the primary nucleus and the new 
nucleus, are well defined. But as has been indicated, a continuous 
spectrum of electrons, from a maximum velocity down to zero, is 
obtained. Such a spectrum can be explained by assuming that dur- 
ing disintegration two particles are ejected from a nucleus in accord- 


ance with the equation n= p+ e-+ v. The energy is divided be- 
tween the electron (or positron) and the neutrino in a random manner. 

The detection of a neutral particle having a negligibly small mass 
(we now know that the mass of a neutrino is less than 0.002 of the 
mass of an electron) is an extremely difficult problem. This problem 
was not solved until 1956. 

We have indicated that a neutrino is formed when a neutron disin- 
tegrates. A particularly large number of such decay events should 
occur in a nuclear reactor, where a tremendous number of nuclear 
fragments, rich in neutrons, are continuously formed. If neutrinos 
exist, a stream of such particles should emerge from a reactor. 
When a neutrino collides with a proton the following reaction should 


occur: v + pt et +n, i.e., a positron and a neutron will be 
formed. Such reactions should be observed in targets containing 
large numbers of hydrogen atoms (i.e., protons) if they are placed 
close to a nuclear reactor. This reaction should occur very rarely 
35—1409 
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(several times per hour) since a neutrino has extremely high penetral- 
ing power. At the same time, a large number of other nuclear react- 
ions occur close to a reactor. 

The difficulties involved in detecting a neutrino are evident. They 
may be overcome by properly utilising the distinguishing features 
of this reaction. We know that the positron is quickly annihilated 
with an electron of one of the target atoms and that such annihila- 
tion yields two photons. The neutron, after covering a certain disLance 
in the target, is absorbed by one of the impurity atoms (cadmium) 
added to the target for this purpose. The average life-time of a neu- 
tron before being absorbed has been calculated. It is equal to ap- 
proximately 5 psec. The absorption of a neutron by cadmium is accom- 
panied by y-radiation. Using modern measuring techniques, the 
experimenter must distinguish the following sequence of events 
from all others: the creation of two Photons, followed in 5 w sec by 
a stronger pulse of y-radiation. Since this has been achieved, the 
existence of the neutrino may be considered to be a fact. 

Alpha and beta decay obey the following formula of decay as 
a function of time: N = Nye. Thus, the decay of a nucleus is an 
independent event that does not affect the behaviour of other nuclei. 
All nuclei have the same decay probability. Assume that half of 
the nuclei disintegrate during an interval of time 7’. Since the remain- 
ing half is subject to the same conditions as the original group of 
atoms, half of the remaining half disintegrate during an equal inter- 
val of time. The fact that the decay of a nucleus is not dependent on 
the behaviour of its neighbours means that in a given time interval 
At the fraction of the number of atoms present which disintegrate, 


i A : 
i.e., =, will always be the same. This statement may he 
expressed as follows: 


By integrating this expression, we obtain the exponential law of 
decay. 

It is useful to remember that the reason why exponential laws are 
encountered so frequently in physics is that they are the mathemati- 
cal expression for decrease in accordance with the widespread rule 


that for equal changes in argument a function decreases by the same 
fraction of its magnitude. 


208. Spin and Magnetic Moment of a Nucleus 


Nucleons, the components of a nucleus, have a spin and hence 
4 magnetic moment. Thus, the presence of spin is not peculiar to an 
electron alone. Elementary particles may have spin, and a visual 
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interpretation of this fact is not only unnecessary, but incorrect. 
We have indicated already that the model of a particle rotating 
about its axis is completely without basis since the spin of a parti- 
cle cannot be interpreted classically. 

The angular momentum of any particle and, hence of an atomic 
nucleus, is given by 


Vs6+) x so , 


and the projection of the spin on the selected direction line may 
assume 2s -+ 1 values in the interval s to — s. Usually, it is not the 
above expression which is called spin, but the number s determining 
this expression. ; ; 

In accordance with the laws of quantum mechanics, the difference 
2s between the largest and smallest values of spin must equal a whole 
number or zero. Therefore, the spin of a particle may equal 

4 , 38 
0, 2D ’ A: = yooee 
: : ; 4 
Neutrons and protons, just like electrons, have a spin of me 


Examining tables of spin values for various atomic nuclei, we 
observe a number of interesting regularities. Thus, nuclei consisting 
of an even number of protons and an even number of neutrons (He, 
C2, 01%) have zero spin. Evidently, the number of nucleons equal 
to a multiple of four generally plays an important role. In many 
cases, but by no means in all, the spin of an atomic nucleus may be 
determined in the following manner: the number closest to MZ which 
yields a whole number when divided by four is subtracted from M. 


and the result is multiplied by =. For example: Li® has a spin equal 


to24= 1, Li7 5, BY — 1 and BUS. 

There are no exceptions to the following, rather obvious, rule: 
the spin of a nucleus for which J is even is equal to a whole number 
or zero, and the spin of a nucleus for which M is odd is equal to an 
odd multiple of 4/2. : or 

The spin of an atomic nucleus may be determined from the hyper- 
fine structure of its optical spectrum. Even though the energy levels 
split to a very small extent, the differences in levels may be meas- 
ured very accurately. Splitting occurs as a result of the fact that 
different mutual orientations of electron spin and nuclear spin 
correspond to different energies. 

Available data on nuclear spins indicate that the Pauli exclusion 
principle is applicable to the protons and neutrons of nuclei. Two 
identical particles can be located at a single energy level only if 
their spins are anti-parallel. Since a proton differs from a neutron 
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two protons and two neutrons may be located at a single level. 
This compact group of four particles, having a total spin of zero. 
will be recognised as the nucleus of a helium atom («-particle). 

If a particle has spin, it also has a magnetic moment. The 
angular momentum L jis directly proportional to the magnetic 
moment M, but the magnetic moment may be either parallel or anti- 
parallel to the spin. : 

If the spin of a particle (simple or complex) is given by s, the 
particle’s magnetic moment may be written in the form 


M = gus, 


: Ki : : 
where p is an elementary magneton equal to lon , m is the mass ol 
4 4 


the particle, and g is a dimensionless factor. This equation is the 
general form of the relation given on p. 498 for the case of an elec- 


sce é 1 ; 
tron. For such a particle, s = and g must be taken equal to 2 in 


order to obtain agreement with available experimental data. 

Different particles (elementary as well as complex) have different 
values of g. For example, the g-factor of a neutron is equal to 3.8206 
and that of a proton is equal to 5.5791. 

The value of an elementary magneton depends on the mass of the 
particle. However, it is customary to use only two magneton values: 
the Bohr magneton for light particles and the nuclear magneton (the 
magneton calculated for the case of a proton) for heavy particles, 


1 
whereby LN = [93g HBonr- The values of the g-factor given above 


are calculated for py. 
A theory of g-factors and magnetic moments which relates these 
properties of a nucleus to its structure does not exist. 


209. Magnetic Resonance 


Assume that a substance containing particles of spin s and magnet- 
ic moment JM is placed in a constant magnetic field of intensity H. 
The potential energy of such a particle is equal to the scalar product 
MN = MH. According to the general law of quantum mechanics, 
this energy may assume only a discrete set of values corresponding 
to 2s + 1 possible orientations of spin and magnetic moment. 

As usual, the resulting system of energy levels is determined by 
the energy transitions. The selection rules allow transitions between 
neighbouring levels only if the difference between their s values is 
equal to one. Assume, for example, that in one state 


M,= gps 
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and in another 
M,=gp(s—1). 
Hlence, the difference in energy is 
' 62—61= gpH. 
The energy levels will be equally spaced. 


The frequency of a radiated or absorbed quantum of energy corre- 
sponding to the calculated difference in levels is 


For an electron 
vy = 2.8 x 10°H 

and for a proton 
v= 3.46 x 10°H. 


It is seen that for each value of A there is a corresponding charac- 
leristic frequency known as the magnetic resonance frequency. For 
the realisable range of field intensities, these frequencies lie in the 
radio band: in the case of nuclei, in tlie short and ultrashort wave- 
length region; in the case of electrons, in the centimetre wavelength 
region. 

Experiments and theoretical calculations show that in practice 
it is not possible to detect the radiation corresponding to these 
frequencies. However, it is possible to observe resonance absorp- 
tion of electromagnetic waves of corresponding wavelength. For 
this purpose, the substance is placed in a coil connected to a high- 
frequency generator and the coil is then placed in a constant magnet- 
ic field. The resonance may be “trapped” by varying the field inten- 
sity while the frequency is kept constant or by varying the frequency 
while H is kept constant. Magnetic resonance is extremely selec- 
live. The width of the absorption peak is of the order of 0.1 mc/s 
at 460 mc/s. 

The magnetic resonance method is widely used in studying vari- 
ous substances. Observation of electronic as well as nuclear resonance 
is of great interest. The presence of electrons having spins which 
are uncompensated for indicates to the chemist that so-called free 
radicals are present and enables him to determine the nature of 
chemical bonds. The chemical composition of a substance may be 
determined by means of nuclear resonance. But the following impor- 
tant fact should be noted. The magnetic resonance effect is so sensi- 
tive as to enable the detection of the superposition of an atom’s 
electronic cloud field on the external field. It turns out that the na- 
ture of this supplementary field depends on the properties of the chemi- 
cal bond between a given atom and the others. Thus, the resonance 
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frequencies of a given atom vary somewhat depending on the chemi- 
cal bond. This phenomenon is known as chemical displacement. 
In Fig. 241, we see an oscillogram of the absorption spectrum of 

a chemical compound. This illustrates magnetic resonance for fluo- 
rine nuclei. We see four peaks, one of which is three times as high as 
the other three. In the molecule whose structural formula is shown 
in the figure, four “different” fluorines are to be seen. There are three 

_ times as many fluorine atoms in the CF, group as each of the other 
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Magnetic field intensity 
Fig. 241 


three ‘chemically different” fluorine atoms in the molecule. Chemi- 
cal displacement has separated the nuclear resonances of the tluo- 
rine atoms and made it possible to determine the structural formula 
of this compound. 

Thus, nuclear resonance provides us with a new means of chemical 
analysis. Instead of obtaining only a gross chemical formula (in our 
example, the total number of fluorine atoms relative to, say, hydro- 
gen atoms), we can now obtain a detailed picture of a chemical for- 
mula, i.e., the proportions of differently bound atoms of a single 


kind. : 
210. Quadrupole Resonance 


The scheme of molecular energy levels discussed in the preceding 
chapter lacks certain detail. It turns out that each rotational level 
has an infrastructure. Between the electronic cloud of a molecule 
and the atomic nuclei there may exist yet another interaction: an 
atomic nucleus may possess an electric quadrupole moment an 
depending on the orientation of the atomic nucleus relative to the 
electronic cloud the molecule may possess different values of energy. 
The magnitudes of the energy associated with this interaction are 
quite small and the corresponding energy levels are grouped around 
the rotational levels. 

Thus, to describe a molecule, one must indicate its quadrupo- 
le energy level in addition to its electronic, vibrational and rota- 


tional levels. 
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Quadrupole interaction does not always exist. If it does, the rota- 
tional transitions discussed above are in fact rotational-quadrupole 
transitions. Pure quadrupole transitions, i.e., transitions between 
separate quadrupole levels, may be observed. Moreover, rotational- 
quadrupole transitions may be resolved. Both problems can be 
solved by radio-spectroscopic methods. Pure quadrupole transitions 
lie in the range of 1-800 mce/s, i.e., in the short-wavelength band. 
Rotational-quadrupole transitions may be detected by studying the 
absorption of microwaves (millimetre waves) in gases. 

Pure quadrupole transitions are of primary interest. They can 
be observed in solids and in certain liquids. 

The following formula gives the energy of interaction between an 
atomic nucleus and the electronic cloud of a molecule for the case of 
an axially symmetrical field (such a field exists in all linear mole- 
cules): eer 

INH —S (Ss 
aed er repre o 
here, Q is the quadrupole moment of the nucleus and gq =< 
is the second derivative of the electric potential along the axis of 
symmetry of the field. This effect does not occur in the case of nuclei 
having a spin of 0 or}/,. Also, no interaction occurs when the elec- 
‘tronic cloud surrounding a nucleus is spherically symmetrical. 

There are a limited number of levels. If the selection rules are 

taken into account, one finds that the number of possible transitions 


‘ ‘ 3 
is quite small. Thus, one line occurs when s= 1 and s = zz two 


, 7 
lines when s= = , and three lines whens = =. 


A quadrupole absorption spectrum can be obtained using a genera- 
tor the frequency of which is continuously varied in the wavelength 
interval under investigation. The resolving power of radiospectro- 
scopic methods is very high. In the case of a spectral line the fre- 
quency of which is of the order of 30 mce/s, the line width is equal to 
several hundred cycles per second. 

The electric quadrupole moment Q is a constant of an atomic nucle- 
us. It describes the deviation from spherical symmetry of the dis- 
tribution of electric charge in a nucleus. The quantity Q (in square 
centimetres) can be determined from the above formula if gat 
is known and the quadrupole frequencies can be measured. The 
deviation from spherical symmetry of the distribution of electric 
charge in a nucleus can be determined to a first approximation by 
representing the nucleus as an ellipsoid of revolution. If the nucleus 
is elongated in the spin direction, then Q > 0; if it is elongated per- 
pendicular to the spin, then Q < 0. 
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An ellipsoidal nucleus tends to become oriented in a very definite 
manner in the field of an electronic cloud. The main energy level 
corresponds to an arrangement in which the axis of symmetry of 
the field and the axis of the ellipsoid coincide. In exciled slates, in 
view of the discreteness of energy, the axis of the ellipsoid can as- 
sume only several selected orientations relative Lo the axis of symmelry 
of the field. The energies of these quantum states is given by the 
above formula. An electromagnetic wave impinging on a molecule 
is absorbed if the magnitude of the photon corresponds to the 
energy of transition from one orientation of the ellipsoid to 
another. 

The study of quadrupole spectra began quite recently. Such spec- 
tra are of great scientific interest as can be gauged by the fact that 
they enable us to measure frequencies with extremely high accuracy 
and that the quadrupole frequency will react to very small changes 
in the electric field of the molecule of which the nucleus is a compo- 
nent as well as of neighbouring molecules. 

Suffice it to say that quadrupole frequencies will differ percepti- 
bly in crystalline varieties of one and the same substance. Thus. 
a nucleus reacts not only to changes in the field due to close elec- 


neta Bnu also te changes in the field due to more distant elec- 


Example. The electric quadru 35 j = 
ac So te nee can pole moment of a nucleus of C155 is Q — 
O00 AO em? Quadrupole resonance occurs in Cl, when v == 


= 54.47 me/s. For a nucleus of Cl5*, the spin s is equal to 4. This means that 


the quantum number m assumes the values = = ae and —$ . Since 
quadrupole ees energy is a function of m2, when an electromagnetic 
energy quantum hv is absorbed only one transition is possible, i.e., a transition 


from the level corresponding to [m|[= = to the level corresponding to 
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By measuring the resonance frequency v and knowing from other data the value 
of the quadrupole moment Q of a Cl®® nucleus, one can determine the gradient 
of the electric field due to electrons at the centre of a Cl®*> nucleus in a Cl» 
molecule: 


OE 
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211. Nueleon Interaction Forces 


Our basic knowledge of nuclear forces can be obtained by studying 
the scattering of particles. It was concluded from Rutherford’s 
firs. experiments in the scattering of a@-particles that nuclear forces 
have a very small range. Rutherford’s results could be explained 
quantilatively by assuming that the deflection of «-particles is due 
lo electric repulsion between charged particles having the same sign. 
The experimental results agreed with the results of theoretical cal- 
culations even when an q@-particle passed extremely closely to the 
scattering nucleus. This means that it suffices to separate two nucle- 
ar particles by a very small distance in order for the effective forces 
to consist only of electric forces; the nuclear’ forces will then no 
longer be effective. 

More direct evidence can be obtained from the scattering of neu- 
trons by protons. For this purpose, a neutron beam is passed through 
hydrogen in a gaseous state. Experiments indicate that only a small 
fraction of the neutrons collide with nuclei of hydrogen atoms. The 
angular distribution of scattered neutrons is uniform. This result 
differs basically from that of q-particle scattering, i.e., scattering 
due to electric interactions. In such scattering, deflection always 
occurs, but the deflection is small when an @-particle passes far from 
a nucleus and large when such a particle passes close to a nucleus. 
It can be concluded from the patterns obtained in the scattering of 
neutrons by protons that the effective range of nuclear forces is very 
small. A: value of the order of 2 X 107-1% cm is reliably deduced from 
such experiments. - 

This is also the value oblained for the effective range from the scat- 
tering of protons by protons. In this case, the experiments and 
calculations are rather intricate since it is necessary to “deduct” that 
portion of the scattering due to purely electric interaction. However, 
the deduction required can be determined using data from observa- 
tions at: high energies and large angles. Unfortunately, direct experi- 
ments in the scattering of neutrons by neutrons are not possible, but 
considerable indirect evidence indicates that in this case too nuclear 
forces have the same properties. For example, let us compare the 
binding energies of tritium (hydrogen isotope of mass 3) and the 
helium isotope of equal mass. In the first case a nucleus consists of 
two neutrons and one proton and in the second of two protons and 
one neutron. It turns out that the binding energy of this helium 
isotope is less than that of tritium by an amount exactly equal to 
the electric interaction between two protons. 

From these experiments one can conclude that the nuclear forces 
acting between nucleons are independent of the electric charges of 
the interacting particles. 


554 Atomic Nuclei 


From experiments in the scattering of nucleons one can also con- 
clude that the interaction is of an exchange nature. The term exchange 
is used to indicate that the colliding particles interchange proper- 
ties, i.e., a proton is transformed into a neutron and vice versa. 
Experimentally, this has been shown to occur in the scattering by 
protons of a beam of neutrons having very high energies (scores of 
times greater than the potential energy of interaction between pro- 
tons and neutrons). One would expect that in such an experiment 
most neutrons pass through hydrogen without scattering, but in 
fact the forward scattered beam consists of equal numbers of 
neutrons and protons. 

The problem of nuclear forces is complicated by the fact that such 
forces depend on nuclear spin orientations. These orientations cause 
nuclear forces to lose their central nature, i.e., the forces do not act 
along the lines joining the centres of particles. 

Many difficulties have not yet been overcome and the nature of 
nuclear forces still remains unexplained. It is quite probable, how- 
ever, that the meson theory of the Japanese physicist Yukawa will 
continue to form the basis of the theory of nuclear forces. This 
theory will be discussed in Sec. 244. 


212. Nucleons in a Nucleus 


Nucleons are very closely spaced in a nucleus. The following for- 
mula for the “radius” of a nucleus is in reasonably close agreement 
with a number of experimental facts: 


R=kyM. 

Here, M is the mass number and k = 1.5 X 10-! cm. The formula 
has been established for heavy nuclei, but it is undoubtedly valid 
for light nuclei as well. Since the radius is proportional to the cube 
root of the number of particles in a nucleus, the volume is propor- 
tional to the first power of this number. Therefore, it must be assumed 
that, at least to a rough approximation, the packing density of 
nucleons in a nucleus is uniform. 

Since nucleons are of a wave nature, one cannot, of course, con- 
struct a geometric model of an atomic nucleus and determine the 
path of nucleons in a nucleus. 

As an approximation, each nucleon can be pictured as moving in 
the field of all the others. Such a nucleon will have a system of 
energy levels which can be filled consecutively in going from light 
nuclei to heavier ones. Like in the case of electrons, the lowest level 
cannot have an angular momentum. In accordance with the Pauli 
exclusion principle, at this lowest level we can have two neutrons 
and two protons (the particles of each pair having opposite spins), 
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i.e., an @-parlicle. An analysis of heavier nuclei shows that stable 
groups of particles will also be located at the other levels. The shell 
model of a nucleus has proved very useful in determining a number 
of nuclear properties and in explaining the prevalence of various 
isolopes. 

It will be noted in examining the composition of atomic nuclei 
that as we proceed from light particles to heavier ones the number 
of neutrons in an atomic nucleus increases more rapidly than the 
number of protons. Thus, in the case of the very stable element lead, 
there are 82 protons and 126 neutrons. This increase in the number 
of neutrons can be explained by the need to counterbalance the 
increased electric repulsion of the protons. 

From another viewpoint, of course, the lack of equality between 
protons and neutrons is disadvantageous. When this equality exist- 
ed, the low energy levels could be filled with a maximum number 
of particles since, in accordance with the Pauli exclusion principle, 
two neutrons and two protons can exist in one state. However, if 
this occurred, the electric repulsion would increase too much and 
the total energy would not be a minimum. Evidently, an actual 
case represents a compromise solution between these two tenden- 
cies. Beta decay phenomena, which occur so frequently in artificial- 
ly radioactive elements, represent the selection of the optimum situa- 
tion in the indicated sense. If there are too many protons or too 
many neutrons in a nucleus, this is corrected by the emission of an 
electron or a positron. 

Like in the case of an atom, a nucleus usually occupies the lowest 
energy level, but it can exist in an excited state. It has been known 
for a long time that radioactive decay is often accompanied by elec- 
tromagnetic radiation. The reason for this is that after emitting an 
a-particle a nucleus remains in an excited state and then passes to 
a low energy level by emitting a photon of very high frequency. 
The latter emission is known as y-radiation. A nucleus can also pass 
into an excited state by colliding with another particle. A study of 
the transition energies of a nucleus enables us to construct its system 
of energy levels. 


213. Interaction of Fast Electrons 


If electrons move slowly, the forces of interaction are determined 
by the arrangement of the charges at the instant ofinteraction.There- 
fore, in the case of slow electrons, the fact that an electromagnetic 
field exists is of no significance, i.e., it is unimportant that the 
interaction is transmitted by means of a field. 5 

The situation is different in the case of fast particles. Here, it is 
necessary to take into account the lag in interaction due to the finite 
velocity of propagation of an electromagnetic field. Interaction at 
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the instant ¢ is determined by the arrangement of the charges al the 
instant t — =. Now, the field cannot be dispensed with. How can the 


quantum nature of a field be taken into account while maintaining 
the particle-field-particle interaction scheme? Such questions are 
dealt with in quantum electrodynamics, a field of theoretical phys- 
ics still being developed. Experimental evidence compels us to 
assume, in considering the interaction of fast particles that such 
interaction consists in the transfer of an energy quantum to the 
field by a particle and the subsequent transfer of this quantum lo 
a second particle. 

If the magnitude of the transferred quantum of energy is equal to 
6 and the time required to transfer this quantum from particle to 
particle is equal to t, then according to the principle of uncertain- 
ty (see p. 479). 


Gt~h. 


If the particles are close to each other, t is small and the particles 

can exchange quanta hv of low and high frequencies. As the distance 
between particles increases, t increases and energy exchange can 
occur only by means of small quanta. Using such reasoning, one 
a derive a formula for the force of interaction between par- 
icles. 

Many authors describe the quantum interaction of particles through 
the medium of a field in the following picturesque terms: In trans- 
ferring a quantum of energy € lo a field, a particle “lends” this ener- 
gy for a short period of time 7. The principle of uncertainty defines 
the dependence of the lending time on the amount of loaned energy. 
It is as if the law of conservation of energy were “violated” for the 
period of time 7, i.e., the lending time. The principle of uncertain- 
ty indicates the permissible time interval during which the law of 
conservation of energy may be “violated” in order for the “violation” 
to be physically meaningless. 

This viewpoint may be developed further, but it leads to the fol- 
lowing difficulty. Since there is no limitation in the absorption and 
radiation of photons, infinitely large changes in the value of a parti- 
cle’s intrinsic energy occur when photons are exchanged. 

It is interesting that this occurs in all approaches to the evalua- 
tion of the intrinsic energy of an electron or other charged particle. 
It was seen on p. 243 that a point particle possesses infinite energy. 
At the same time, it is impermissible to assume that the particle 
has finite dimensions since a perfectly solid particle having finite 
dimensions cannot exist according to the theory of relativity. (The 
existence of perfectly solid bodies is inconsistent with this theory 
since interaction would be propagated instantaneously through such 
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bodies.) But if an electron is a deformable particle, what is its struc- 
ture? A solution to this problem has not yet been found. 

A distinctive feature of the new theory is that, in spite of the fact 
that it is based on contradictions and lacks logical harmony, it 
leads to a number of new, very interesting results. 


214. Meson Theory of Nuclear Interaction 


In the preceding article, we stated that interaction between elec- 
trically charged particles occurs through the medium of an electro- 
magnetic wave by means of quanta. An electrically charged particle 
transmits a quantum of energy to an electromagnetic field and this 
energy is then transmitted to another particle. If it is assumed that 
a field is associated with nuclear forces and that this field is also of 
a quantum nature, interaction between nucleons can be described 
as follows: Each nucleon is surrounded by a field; a nucleon transmits 
a quantum of energy to the field and the field transfers this energy 
to another nucleon. 

A theoretical study to determine whether such an explanation of 
nuclear forces is permissible was undertaken by Yukawa. It turned 
out that a theory could be developed if it is assumed that the field, 
through the medium of which nucleons interact, possesses quanta 
having a rest mass that is not equal to zero. Interaction between 
nucleons was reduced in this manner to an exchange of particles 
having a mass m-=«0. For reasons to be explained below, such parti- 
cles became known as mesons. A meson is a quantum of the mesonic 
field which surrounds nucleons. 

Now, let us examine several conclusions to be drawn from the 
theory. First, we shall try to estimate the range of nuclear 
forces. 

The energy of a meson which is transferred by a nucleon during 
interaction cannot be less than moc?, where mo is-the rest mass of 
the meson. The order of magnitude of the time taken to transfer 


‘ h : ; : 
a meson is nol greater than ——, (on the basis of the relation @t ~h, 
‘0 


where @ is the quantity of transferred energy and 7 is the time taken 
Lo transfer this quantum). Since the velocity of a meson is less than c, 


a meson cannot be transferred over a distance greater than 


mge 
Thus, the value of this constant quantity should give us the range 
of nuclear forces. 

Such is the conclusion drawn by Yukawa. Using the range of nucle- 
ar forces known at that time, Yukawa showed that theoretical and 
experimental results coincide when mp), the rest mass of a meson, 
is 200-300 times greater than the mass of an electron. 
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Positive, negative and neutral mesons have equal validity in 
this theory. Thus, all interactions between nucleons can be easily 
integrated within the framework of one theory. Designating Yuka- 
wa’s meson by x and using a superscript to indicate the sign of the 
meson, one can express the interaction between two neutrons or lwo 
protons as an exchange process involving a neutral meson: 


pwptn;, nw@n+n°. 


An exchange interaction between a proton and a neutron is a proc- 
ess involving a positive or a negative meson: 


pyn+n;, ne pt+qr. 


Yukawa’s theory was developed before the discovery of mesons. 
At present, the above interaction formulas no longer represent theo- 
retical predictions, but rather expressions for phenomena which 
have been actually observed. ; 


215. Mesons 


The term “meson” (from the Greek word “mesos”, meaning average 
or intermediate) was coined to indicate that the mass of such a par- 
ticle lies between that of an electron and that of a proton. Experi- 
ments show that several kinds of mesons exist. Mesons (electrically 
charged) were first detected in cosmic rays. Now, mesons are created 
in accelerators—they arise when nucleons collide. 

However, not all mesons play the same role in interactions between 
nucleons. The Yukawa meson is a m-meson. As has been indicated, 
positive, negative and neutral x-mesons exist. Recent measure- 
ments indicate that the mass of a m-meson is equal to 273 me, 
where me is the mass of an electron. 7 

The mesons which were first found in cosmic rays are 1-mesons 
(positive and negative). The mass of a p-meson is equal to 207 mg. 
Ten years elapsed after the discovery of »-mesons hefore it was shown 
that such mesons are products of m-meson decay. The reason why 
investigators were able to detect p-mesons but were unable to 
detect m-mesons lies in the different lifetime of these particles. The 
average lifetime of a jt-meson is about 10-° sec, but that of a m-meson 
is about a hundredth of this value, i.e., of the order of 10-8 sec. 

Such transformations have been photographed numerous times. 
If we follow along a p-meson track on such a photograph, we may 
find that the velocity of the particle decreases (easily recognised by 
the change in the thickness of the particle track: the slower the par- 
ticle the heavier the track since a slow particle creates more ions 
along its path than a fast one). At the point where the track ends, 
another track begins. Calculations indicate that the new particle 
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is of very high energy, which may be explained only if we assume 
that a part of the rest energy of a a-meson has been transformed into 
kinetic energy. In addition, in order to satisfy the law of conserva- 
tion of momentum, one must assume that a neutral particle of very 
small mass is created when a m-meson is transformed into a p-meson. 
Thus, the neutrino is again encountered. 

The rest mass of a m-meson is of the order of 150 Mev. Therefore. 
i meson may be produced by nuclear bombardment with projec- 
tiles having an energy greater than this value. Actually, an energy 
greater than 300 Mev is required. Nuclear bombardment with high- 
energy particles results in the creation of a large number of mesons. 

Charged and neutral mesons have been created under laboratory 
conditions. In the case of neutral mesons, this is particularly diffi- 
cult since the lifetime of such a particle is 10-'® sec. This means that 
the particle traverses a distance of only a thousandth of a milli- 
metre during its lifetime. 
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When the Schrédinger equation was discussed in Sec. 180, we did 
not use the relations of the theory of relativity since we assumed 
that the velocities of the particles were much less than the velocity 
of light. 

We cannot dispense with the relativistic corrections when dealing 
with high-energy electrons, i.e., electrons having energies of the 
order of millions of electron-volts. Such high-energy electrons are 
encountered in radioactive radiation, in X-ray tubes the operating 
voltages of which are equal to several million volts, in betatrons, etc. 

To correctly describe electrons having such velocities by a wave 
function, one must take into account the relationship between ener- 
gy and momentum given by the theory of relativity. 

If the rest mass of a body is given, the energy and momentum of 
the body can be directly determined from the velocity of the body. 
Therefore, energy and momentum are related. Squaring the expres- 
sions 


é moc 
£2 and p=, 


V1i—p2 Vi-p2 


and then subtracting, we obtain 


This relationship can be written in the following form: 


C=AcV p+ me 
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(see Fig. 242). When p = 0, & = + myc*. This means that the coor- 
dinates of the points at which the two branches of the curve intersect 
the axis represent equal rest energies of the particle. 

For a long time, no attention was paid to the negative branch of 
the curve. It seemed impossible that particles could exist which 
obeyed the laws of this branch. Indeed, a particle having negative rest 
energy and the energy of which decreases as the momentum increases 
must behave very strangely. It is like a particle having negative 
mass, which means that when a force is exerted on such an “electron’ 
the particle is accelerated in the opposite 
direction to that of the force. Thus, if 
we try to altract such a particle, it is 
repelled. Imagine that we have two close 
electrons—an ordinary one and an extra- 
ordinary one, i.e., one which obeys the 
laws of the lower curve. Electrons should 
repel each other. Therefore, the ordinary 
electron will tend to move away from 
the extraordinary one. But since the latter 
“electron” will be attracted under the 
action of the force of repulsion it will 
move toward the ordinary electron. As 


a result, this pair will be accelerated 
jointly, whereby the increase in positive kinetic energy of the 


ordinary electron will be counterbalanced by the increase in 
negative energy of the extraordinary particle. 

Does such a particle exist? First, it should be noted that from the 
viewpoint of quantum mechanics this is not another particle at all. 
The negative branch of the curve of energy as a function of momen- 
tum should be interpreted as giving the lowest energy level of one 
and the same particle, namely, an electron. An electron may be 
located at an ordinary energy level or at a negative energy level. 
Like in the other cases, transition to the lowest energy level should 
be accompanied by energy radiation. 

To be sure, in such a transition the radiation cannot be limited to 
a single photon. This may be explained as follows. When an electron 
passes from an &;, p; state of the upper curve to an &2, pe state of 
the lower curve, the energy released is 


Fig. 242 


E162 = (V pi + mic? + V p+ mic), 


i.e., more than c (p; + po). If this energy were transferred to a sin- 
gle photon, then we should have 6; — G2 = hv and, therefore, py + 


+ po= Lee i.e., the change in the momentum of the photon (gy; is 
Cc 
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anti-parallel to j2) would equal say But 6: — &@2 is greater than 


c (py + pe), not equal to it. In the case of a single photon, it is not 
possible to satisfy simultaneously the law of conservation of energy 
and the law of conservation of momentum, but this is always pos- 
sible if two photons are emitted. : 

We should not be disturbed by the fact that the lower state corre- 
sponds to negative energy and that particles with € < 0 behave very 
strangely. The difficulty lies elsewhere: Since there is limitless room 
at the lower level in view of the fact that it extends to all possible 
values of momentum, why do not all ordinary electrons pass over 
to this level? The proposed model accounts for the existence of only 
“extraordinary” electrons. To accord with this model, an ordinary 
electron should have a short lifetime, much like an excited atom. 
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A relativistic theory of the electron was developed by Dirac in 
1928. The description of an electron in this theory differs considera- 
bly from that in the Schrédinger theory. Four wave functions are used 
to describe the behaviour of an electron since one wave function no 
longer suffices. The existence of electron spin does not follow from 
the Schrédinger theory, but electron spin is a necessary consequence 
of the Dirac theory. The success of this theory in describing numerous 
phenomena speaks for the validity of its basic concepts. Here, we 
shall consider in the light of this theory only the contradiction dis- 
cussed in the preceding article. How can we explain the fact that 
a tremendous number of ordinary electrons exist? How can we 
explain the “reluctance” of these electrons to pass over to the lower, 
negative energy level? 

The Pauli exclusion principle does not allow an electron to pass 
over to a lower level if that level is occupied by two electrons having 
opposite spins. Let us apply this principle to our problem. The solu- 
tion will be obtained if we assume that all negative energy states are 
occupied. Does this mean that all space is completely filled with elec- 
trons having a negative energy state? This conclusion appears ines- 
capable. The Dirac theory has led to a new view of vacuum. In this 
theory, vacuum acquires physical properties, i.e., it is filled with 
electrons in a negative energy state. Moreover, it is filled boundlessly 
since the decrease in energy can extend to negative infinity. 

Let us determine which phenomena can be explained and predict- 
ed by this theory. If all negative energy states are filled, their exist- 
ence may be detected only when an electron passes from a negative 
energy level to a positive energy level after receiving a significant 
portion of energy—in any case, not less than 2 m,c?. Such a process 
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may involve the expenditure of two photons. Another possibility 
is the following: A photon passing close to a heavy atomic nucleus, 
which has a strong electric field, may give up its energy to raise an 
electron from a negative to a positive level. The role of the atomic 
nucleus consists in providing the necessary momentum. 

In both cases an electron is “created”. But in addition to the crea- 

tion of an electron, a “hole” is produced in the negative energy 
states. The removal of an electron, i.e., a negative charge, means that 
. the hole acquires a positive charge of equal magnitude. On the other 
hand, the absence of a particle having negative energy signifies that 
the energy has increased. Therefore, the greater the momentum of 
a hole, the greater its energy. The inescapable conclusion is that 
“holes” behave like positively charged electrons having positive 
energy. Except for the sign of the charge, the behaviour and laws of 
motion of a positive electron (positron) in no way differ from those 
of an ordinary electron. 7 

A positron and an electron are “created” as a pair at the expense 
of the energy of photons. The reverse process, i.e., annihilation, is 
also possible. This consists in the transformation of a colliding elec- 
tron and positron into two photons or, if annihilation occurs close 
to a heavy atomic nucleus, into a single photon. 

It is understandable why positrons have a short lifetime: they are 
attracted by electrons and disappear upon collision. But why do not 
electrons disappear? The reason is simply that there is an excess 
of them. 

Do systems exist in which there are an excess of positrons, i.e., in 
which electrons are unstable? Such a supposition isnot at all ridiculous. 

Pair creation and annihilation are processes which can be easily 
observed in large numbers under laboratory conditions. When 
y-rays having an energy greater than 1 Mev pass through a thin 
metal foil, the electrons which are ejected from the foil will be deflect- 
ed in a magnetic field in opposite directions. By tracing the path of 
a positron (means of observing charged particles are discussed in 
Sec. 204), one can determine the point at which the particle disap- 
pears. This is where the positron combined with an electron. Using 
modern photon counter devices, one can show that two oppositely 


directed photons, each having an energy of the order of ue , simul- 
taneously emerge from this spot. 


218. Particles and Anti-Particles 


There is no reason for supposing that the existence of the posi- 
tron—it would be better to call it an anti-electron—is due to a pecul- 
iar feature of small particles. In spite of the distinctive features of 
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the theory of interaction between nucleons, it is basically similar 
to the theory of interaction between electrons. In most theoretical 
studies, nucleons are considered to be described by equations which 
are quite similar to the Dirac equations for electrons. It was to be 
expected, therefore, that nucleons have anti-particles which stand 
in the same relationship to the proton and neutron as a positron does: 
to an electron. The first of such anti-nucleons to be discovered was. 
the anti-proton. Somewhat later the anti-neutron, the magnetic 
moment orientation of which differs from that of the neutron, was 
discovered. (The magnetic moment and the angular momentum 
vector are antiparallel in the case of the neutron and; parallel in 
the case of the anti-neutron.) 

The discovery of the anti-proton proved that the concept of an 
inseparable link between field and particles is well founded. Like in 
the case of a positron-electron pair, a proton-antiproton pair may be 
created by the passage of a particle (a nucleon) from a negative ener- 
gy state to a positive energy state. For this purpose, an energy of at 
least 2//c? is required. This is a tremendous amount of energy— 
1,840 times the energy required to create an electron-positron pair. 
An accelerator which could accelerate particles to billions of elec- 
tronvolts had to be constructed before it was possible to discover 
the anti-proton. 

If a proton and an anti-proton collide, they will be annihilated. 
Since nucleons transfer energy through the medium of a meson field, 
when they are annihilated their mass and energy are given up to 
quanta of this field, i.e., to mesons. This process will undoubtedly 
be subjected to careful study in the years ahead. 

Fig. 243 shows a photograph of the annihilation of a proton and 
an anti-proton. This process occurred in a bubble chamber filled 
with liquid propane. A sketch of the process is given in the upper 
left-hand corner. 

The reasoning used in explaining why anti-particles must exist 
extends to the anti-neutrino as well. An anti-neutrino is the “mirror” 
image of a neutrino. The difference between the particles composing 
this doublet is the same as in the case of the neutron-antineutron 
doublet. 

Mu mesons and other elementary particles which have not been 
discussed are also encountered as doublets. © 

Pi-mesons constitute a triplet: thére is a m-meson having a posi- 
tive charge, another having a negative charge and still another having 
zero charge. In contradistinction to the neutron and the neutrino, 
the neutral m-meson has no spin and, therefore, can have no anti- 
particle (or expressed differently: the neutral m-meson coincides 
with its anti-particle). The photon is another particle which has no 
“mirror image”. 
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Fig. 243 
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219. Asymmetry of Elementary Particles 


Nucleon-nucleon interactions occur with the emission and absorp- 
tion of m-mesons. This is the strongest of the various interactions 
between elementary particles and is responsible for the force binding 
nucleons in an atomic nucleus; its duration is 10-®° sec. Nuclear 
forces are about a hundred times as powerful as electromagnetic 
forces; the duration of electromagnetic interaction, which occurs. 
by the exchange of photons, is 10-74 sec. These two types of interac- 
tion are known as strong interactions, in contradistinction to the 
weak interactions occurring in particle transformations in which 
neutrinos take part. Examples of weak interactions are the transfor- 
mation of a neutron into a proton, accompanied by the emission of 
a neutrino and an electron (see B-decay), and the decay of n- and 
-mesons: 

m—>p+v and p—-e+v+v. 


The duration of weak interactions is of the order of 10-® sec. Nuclear 
forces are about 10“ times as powerful as weak interaction forces. 
In calculating the interaction force from the duration of the process, 
we are assuming, of course, that other conditions remain equal. 

Some. extremely interesting discoveries have been made recently 
in connection with weak interactions. It has been found that weak 
processes exhibit “left-right” asymmetry. Thus, for example, in 
B-decay of cobalt nuclei polarised at low temperatures by means of 
a magnetic field (polarisation of the particles orients their magnetic 
moments and spins in a definite direction), the angular distribution 
of electrons is asymmetrical with respect to the “forward” and “back- 
ward” directions. Similarly, in -meson decay, asymmetry with re- 
spect to the direction of motion of the particles has been detected. 

A theory for this phenomenon was proposed by Lee and Yang, and 
by the Soviet physicist Landau. Two explanations are possible: 
either the particles are internally asymmetrical or space is asymmet- 
rical. We shall restrict ourselves to the first explanation. Its essence 
lies in the assumption that elementary particles are like screws as 
regards their symmetry properties. Such asymmetrical particles are 
well known to the physicist. They include, for example, the left and 
right optical antipodes of molecules (see p. 441). It is clear that an 
asymmetrical clementary particle the axis of which is randomly 
oriented has different properties in the “forward” and “backward” 
directions. This has been proved oxpcrimentally, 

In order to explain the asymmetry observed in experiments with 
beams of p-mesons, one can use the Landau hypothesis, which re- 
lates the asymmetry of a particle to its charge. As indicated in the 
preceding article, all particles with the exception of the photon, are 
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encountered in nature as charged pairs. Landau proposed that if 
the symmetry properties of a particle are like those of a right-hand- 
ed screw, the properties of its anti-particle are like those of a left- 
handed screw. Reflection in a mirror makes a right hand appear like 
a left hand and a right-handed screw like a left-handed screw. Thus, 
the “mirror image” of a particle is its anti-particle. 

What application has this hypothesis to experiments with beams 
of y-mesons? It can be proved that a particle having no mass must 
be oriented with its spin in the direction of motion. The mass of 
a neutrino is evidently equal to zero. Therefore, neutrinos are “lon- 
gitudinally polarised”. The difference between a neutrino and an 
anti-neutrino can be reduced to the following: the spin orientation 
of a neutrino is in its direction of motion, while that of an anti- 
neutrino is in the opposite direction. Mu mesons are formed when 
st-mesons decay. Since the spin of a t-meson is equal to zero, the spin 
of a w-meson must be parallel to the spin of a neutrino. Therefore, 
the p-mesons formed in a m-meson beam are longitudinal. This 
explains the asymmetry observed in the distribution of electrons 
during thefsubsequent decay of these u-mesons. 

Investigation ofjthe asymmetry of elementary particles may result 
in a significant revision of a number of fundamental concepts in 
physics. 


CHAPTER XXXI 
NUCLEAR TRANSFORMATIONS 


220. General Laws of Chemical and Nuclear Transformations 


Now, we shall discuss several energy relations which are equally 
applicable to chemical reactions and to transformations of atomic 
nuclei and other particles. 

Transformations can occur only when particles closely approach 
each other. Since particles must possess a certain kinetic energy in 
order for a transformation to occur we are 
quite justified in using the term “colli- U 
sion” to describe a close approach between 
particles. Not every encounter between 
particles results in a transformation. The 
mechanism of chemical and nuclear trans- 
formations is very difficult to study. 
Since direct observations are impossible, 
one is forced to make hypotheses the 
validity of which can be checked indi- 
rectly. 

In the case of chemical transformations, ; 
the mutual orientation of molecules upon Fig. 244 
- collision undoubtedly plays an important 
role. In order for a reaction to occur, molecules must approach 
each other in a manner suitable for the regrouping of atoms. 

For every transformation which can occur on a mass scale—the 
usual case in chemical and nuclear reactions where billions of mole- 
cules or nuclei collide during a short interval of time—one can 
indicate, in principle, the number A which, generally speaking, will 
be a measure of the fraction of encounters in which particles are in 
a position “suitable” for a transformation to occur. 

However, the requirement of appropriate orientation is, of course, 
not the only condition to be met in order for a transformation to 
occur. Since a particle is ordinarily stable and, hence, possesses 
a minimum of potential energy, an energy sufficient to lift the mole- 
cule out of its potential well must be imparted to it. This minimum 
necessary energy is called the activation energy. Fig. 244 shows a poten- 
tial energy curve. The particle is stable when r = 0, where r is 
a fixed parameter. In order for a reaction to occur, an activation 
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energy @ must be provided. For the case! represented in the figure, 
the reaction proceeds with a liberation of heat. 

We can apply the Boltzmann law to collisions between molecules, 
or between nuclei (under similar conditions), i.c., we can assume 
that the number of encounters resulting in a transformation is pro- 
portional to e-@/kT, where @ is the activation energy. 

It is evident that the rate of transformation may be expressed as 
the product Ae~G/:T, where the first factor takes into account the “geo- 
metric” conditions of encounter and the second the energy aspect. 

It is customary to give special consideration to the case of two 
colliding particles at the instant when the potential energy is a max1- 
mum. Such an activated complex (as it is referred to in chemistry) 
or compound nucleus (as it is re- 
ferred to in the study of nuclear 
transformations) does not exist 
for a very long period of time. 
The system may “slip back” into 
the potential well or “roll over’ 
the well wall. In the latter case, 
a transformation has occurred 
and a new system with a new 
Absorbed potential energy has been 

heat formed. 

In chemical as well as nuclear 
transformations, the resulting 
system may consist of a single 

Fig. 245 new particle (addition reaction) 
or two new particles. 

If the potential energy of the new particles is greater than the 
potential energy of the original particles (cf: the bottom of a vol- 
cano’s crater is below the level of the foot of the mountain on which 
the volcano is located), the transformation proceeds with the absorp- 
tion of energy. The absorbed energy (heat) will be equal to the differ- 
ence between the activation energy and the energy of the reaction 
products (see Fig. 245). If the energy of the created particles is less 
than the energy of the original particles, heat is liberated. 

Both kinds of transformations are encountered in chemistry and 
in nuclear physics. Reactions proceeding with a liberation of heat 
are called exothermic, while those proceeding with an absorption 
of heat are called endothermic. 

Chemical and nuclear transformations are often accompanied by 
radiation. However, as a rule, the main energy effect of a reaction 
consists in the transformation of the potential energy of the atoms in 
a molecule (or nucleons in a nucleus) into kinetic energy. Therefore, 
generally speaking, transformations in which heat is liberated are 
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those in which two slow particles collide and produce two fast ones. 
Of course, in endothermic reactions, the opposite takes place. 

It is seen from the formula that the rate of transformation increases 
exponentially with temperature. This is the reason why chemical 
transformations are extremely sensitive to changes in temperature. 
The higher the temperature, the greater the impact of colliding par- 
ticles. It is well known that temperature plays an important role in 
chemical transformations. In nuclear transformations, as a result of 
the tremendous values of binding energy, the role of temperature 
change is not so noticeable. The activation energy of atomic nuclei 
has an order of magnitude of several Mev, but if, for example, the 
temperature is increased to 3,000°, the energy of an atomic nucleus 
increases by only 0.4 ev. 

The temperature must be increased to millions of degrees rather 
than to thousands if we wish to accelerate nuclear transformations 


(see below). 
221. Nuclear Reactions 


A mass of experimental data has been accumulated on nuclear 
reactions. The number of such reactions which have been studied 
reaches several thousand. 

At present, the following types of nuclear reactions are known 
(in addition to radioactive decay, which may be viewed as a nuclear 
decomposition reaction): capture reactions, in which two colliding 
particles combine; exchange reactions, in which a particle is cap- 
tured and another is ejected; and fission reactions, in which a nucleus 
breaks up as the result of energy received in one or another form. 
Nuclear reactions which occur under the action of hard y-rays are 
known as photonuclear reactions. 

By means of nuclear reactions, one can obtain stable natural iso- 
topes as well as unstable radioactive isotopes which are not encoun- 
tered in nature. It has proved possible to synthesise elements which 
have no stable isotopes (for example, technetium, an element having 
the atomic number 43), and also transuranic elements. 

The reactions which occur when various nuclei are bombarded with 
a-particles, protons and neutrons have been studied most care- 
fully. 

When an a@-particle collides with a nucleus, one of two types of. 
reactions generally occurs: either the a-particle is captured and a pro- 
ton (p) is ejected or the a-particle is captured and a neutron (n) is 
ejected. These reactions are designated by the symbols (a, p) and 
(a, m), respectively. 

The equation for an (a, p) reaction has the form 


2EL™ + get —> gE *? + yp? 
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The equation for an («, n) reaction has the form 
zEl™ + oc —> pypB lt? + on}, 
Here are some examples of a-particle reactions: 
(&, p) :7N™ + 2a4 —> 402? + yp}; 
13A 1°? + 9004 —> 4,519 + yp; 
(a, 2): ,Be® + .a4—> ,Cl2 + yn; 
5310+ oat —> 7N8 + gn. 


The first of the above (a, m) reactions is of great practical impor- 
tance since a mixture of radium (a-particle source) and beryllium 
is a common neutron source. 

A large class of reactions occur as the result of collisions with pro- 
tons. Such reactions include (p, a) reactions (in which a proton is 
captured and an a-particle is ejected), e.g., 


E19 + ,pt—> 018 + ,Het, 


and a m) reactions (in which the ejected particles are neu- 
trons). 


Capture reactions which are not accompanied by the ejection of 
a particle also occur. The excess energy is released in the form of 
y-rays. Therefore, such reactions are designated as (p, ) reactions, 
€.2., 
3Li’ + 1p — > ,Be§, 


Reactions involving deuterons (d), e.g., (d, p) and (d, n) reactions, 
have been subjected to careful study. When heavy water (deuterons) 
is bombarded with deuterons, a radioactive isotope of hydrogen, 
viz., tritium, may be formed: 


,D? + ,D? —> ,H? + ,p'. 
However, a (d, n) reaction may also occur: 
1D? + ,D? aaa 2He?® + git. 
Reactions involving neutrons are of great importance in nuclear 
engineering since they occur abundantly in nuclear reactors. Such 
reactions include (n, a), (n, p), (m, 2) and (n, y) reactions. In addi- 
tion, reactions involving the fission of heavy nuclei occur under 
the action of neutrons (see below). 
Two kinds of reactions occur between nitrogen and neutrons: 
(n, p) : N44 + gn? —> 6C4 + yp; 
(12, 2) : 7N14 + gn! — {N18 + 2on!. 
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The first of these reactions yields a carbon isotope of long lifetime 
(more than 5,000 years). This isotope is of great importance in bio- 
chemical investigations. 

Almost all isotopes are capable of capturing a neutron [(n, y) reac- 
tion]. In this manner, an isotope of the same element, the mass of 
which is one unit greater than the original isotope, is formed. Usual- 
ly radioactive isotopes (B-radioactivity) are produced. 

Exothermic and endothermic reactions are possible in nuclear 
chemistry, like in molecular chemistry. The magnitude and sign 
of the thermal effect can be determined using the principle of the 
equivalence of mass and energy. 

If nuclei of masses /7, and M, are formed from nuclei of masses 
M, and Mz, then 

M,+M,=M;+M,+Am. 


When Am > 0, i.e., when the nuclei of the reaction products have 
less mass than the original nuclei, the reaction is exothermic. When 
Am < 0, i.e., when the nuclei of the reaction products have greater 
mass than the original nuclei, the reaction is endothermic. The 
energy released or absorbed during a reaction can be determined 
from the formula ¢ = c?Am. Calculated and experimental results 
are in perfect agreement in all cases. 

In most nuclear reactions, the thermal effects are of the order of 
millions of electron-volts for each pair of reacting nuclei. This is 
millions of times greater than the corresponding values in chemical 
reactions. 


Sample calculation, Let us consider the reaction 
,Be® + yp? —> Li’ +- 204, 


The following are tabulated handbook values of the masses occurring in this 
reaction: mpe = 9.01503, mp = 1.00812, mpi = 6.01697 and mg = 4.00390. 
The reaction is exothermic since Am = 0.00228, i.c., Am > 0. Since an atomic 
weight unit corresponds to 931.8 Mev, the thermal effect is equal to 0.00228 x 
X 931.8 = 2.42 Mev or about 8 X 107 calories (1 Mev = 3.827 x 
x 10-14 calories). 


222. Fission Reactions of Heavy Nuclei 


Atomic nuclei which are heavier than tin nuclei are capable of 
splitting into two parts of approximately equal mass. In order for 
this fission to occur, a considerable amount of activation energy 
must be imparted to such a nucleus (the greater the isotopic mass 
number the smaller the activation energy). For nuclei of uranium, 
thorium and palladium, this energy is equal to about 5 Mev. The 
fission reactions of such nuclei are exothermic; the energy released 
considerably exceeds the activation energy. 
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Fission may be caused by protons, deuterons, a-particles or y-ra- 
diation. However, fission caused by a neutron hitting a heavy 
nucleus is of paramount significance. 
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Let us consider fission of uranium-235 under the action of neu- 
trons. When a neutron is captured, a uranium isotope with a mass 
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‘one unit greater than U®%5 is formed: 
92238 +. 91 —> 9p U238, 


Fission occurs if the activation energy of U*%* is less than the energy 
supplied by the neutron. The captured neutron brings to the nucleus 
an energy equal to the binding energy of the neutron plus the kinet- 
ic energy of its motion. In the case of U***, the energy supplied 
by a neutron of thermal velocity is sufficient to produce fission. 

Since a large amount of energy is required to split a nucleus of 
U238, the nucleus of this isotope can be split only by means of fast 
neutrons. 

A nucleus of U™* generally divides into two unequal fragments. 
Nuclei split randomly and yield various primary products. Fig. 246 
shows the yield of uranium-235 fission products as a function of 
mass. It is seen from the curve that division into equal fragments 
has a minimum probability. Most frequently the masses of the frag- 
ments stand in a ratio of about 2 to 3 (e.g., Sr® and Xe?%). These 
nuclei have enormous energies—a light nucleus has an energy of 
about 100 Mev and a heavy nucleus 65 Mev. The large variety of 
fission products is due not only to the fact that a nucleus.may split 
in numerous ways, but also to the fact that a number of other pro- 
ducts are created when radioactive fragment nuclei decay. 

Nuclear fission products have an abnormally large number of 
neutrons. By means of a chain of transformations, radioactive nuclei 
assume a normal stable state. For example, 


— 55829? —_—_-— seals”, 
in 33 years 


sal!” 30 sec oxen 3.4 mi 
The half-life is indicated under the arrows. It is evident from the 
values of decay time that, practically speaking, only Cs18’ is obtained 
as the product of this chain of transformations. 

Another very important fission product of uranium-235 is stron- 
tium-90 (see below). 

Nuclear fission of uranium-235 is accompanied by the release of 
an enormous amount of energy. Thus, 1 gram of uranium yields as 
much energy as the combustion of 2.5 tons of coal, i.e., 22,000 kwhr. 
Most of the energy is released in the form of kinetic energy of fission 
fragments and about 10% is released in the form of radiation. 
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All aspects of the uranium-235 reaction have been studied in 
detail. Uranium-235 is the only natural isotope from which energy 
can be obtained on an industrial scale. 

Since neutrons are required to achieve fission and a neutron gas 
does not exist in nature, a large amount of energy can be obtained 
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only by means of a chain reaction in which new neutrons are conlin- 
uously created. This is precisely the method used to achieve nuclear 
fission of uranium-235. Every time a U-235 nucleus is split several 
neutrons are liberated. The number of such neutrons varies from 
case to case, but on the average 2.5 neutrons per fission are obtained. 
If the neutrons formed when a nucleus splits are able lo cause 
fission of other uranium atoms, a chain reaction may occur. 

Since neutrons’ have a long range, the neutrons formed upon /is- 
sion of a uranium-236 nucleus have a large probability of escaping 
from the substance without splitting other nuclei. Moreover, it 
should be realised that not every encounter between a neutron and 
a uranium-235 nucleus results in fission. 

The growth of a chain reaction is a function of the neutron mul- 
tiplication factor. The value of this quantity (Ko) can be calculated 
for the case of a system having infinite dimensions. To perform this 
calculation, one must know the neutron multiplication due to fission 
by slow and fast neutrons and also the probability of neutron cap- 
ture by nuclei of nonfissionable materials. 

_ The extent to which Ky exceeds unity is a measure of the increase 
in the number of neutrons of a given generation over the preceding 
generation. 

However, a reactor has finite dimensions. Therefore, the neutron 
multiplication factor must be written in the form K = Ko (4 — p), 
where p is the probability that a neutron will escape from the reactor. 
nm order for the reactor to operate, Kg must be greater than unity. 

uring reactor operation, the multiplication factor K must be 
exactly equal to unity. : 

The dimensions of a system containing nuclear fuel are said to 
be critical when the multiplication factor of the system is equal 
to unity. 

Let us examine the factors which affect K. The probability of 
a neutron colliding with another nucleus before escaping from the 
substance may be increased by amassing a large quantity of nuclear 
fuel. The same end is achieved by reducing to a minimum the num- 
ber of atomic nuclei capable of absorbing neutrons, since such 
absorption removes neutrons from the reaction. The probability of 
capture may be increased by slowing down the neutrons. (When 
a nucleus splits, fast neutrons are formed, but a uranium-235 nu- 
cleus is best able to capture slow neutrons, i.e., so-called thermal 
neutrons.) 

Nuclei of uranium-235 may be split by fast neutrons as well as 
by slow ones, but the probability of being split by the former is 
less. If the nuclei of a substance may be split only by fast neutrons, 
it becomes impossible to produce a chain reaction: a neutron created 
during fission of a nucleus and slowed down even slightly as the 
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result of one or two chance encounters which do not result in fission 
will be lost to the chain reaction. ; 

At present, the most important nuclear fuels are able to produce 
a chain reaction with slow neutrons. Such fuels include one natural 
isotope (uranium-235) and two artificial elements, viz., plutonium- 
239 (obtained from uranium-238) and uranium-233 (obtained from 
thorium-232). 

The quantity of nuclear fuel in a particular volume must exceed 
a certain minimum in order for a chain reaction to begin. We need 
not be concerned about the first neutron since, thanks to cosmic 
radiation, a small number of neutrons are always present in the 
atmosphere. Moreover, spontaneous fission, i.e., fission due to inter- 
nal forces, also occurs occasionally. (This phenomenon was discov- 
ered by the Soviet scientists Flerov and Petrzhak. They found 
that it is not always necessary for a neutron to be captured in order 
for fission of a uranium-235 nucleus to occur.) Finally, a mixture 
of radium and beryllium may serve as a source of initial neutrons. 


224. Principle of Operation of a Nuclear Reactor 


If a chain reaction begins in a given mass of nuclear fuel and if 
the reaction is uncontrolled, the result will be an explosion since at 
each instant the number of neutrons, and hence the quantity of 
released energy, will be greater than at a preceding instant. The 
quantity of energy released in a small fraction of a second will be 
so great that an explosion results. 

In order to release a constant or controlled amount of energy, one 
must build an installation in which the neutron multiplication 
factor can be controlled. An installation which allows us to do this 
is called a nuclear reactor or pile. In such an installation, it must 
be possible to begin a chain reaction with a multiplication factor 
slightly greater than unity. Then, the concentration of neutrons 
inside the pile, and hence the power of the reactor, will begin to 
increase. After raising the power to the desired level, one must be 
able to set the multiplication factor exactly equal to unity. The 
reaction then becomes self-sustaining since the number of neutrons. 
and the energy released per unit time remain constant. 

A reactor should be constructed in such a manner that the neutrons 
formed upon fission are utilised most effectively. But effective 
utilisation of neutrons does not mean that they must be used exclu- 
sively for the fission of nuclei and the release of energy. Substances 
the nuclei of which absorb neutrons may be introduced into the 
pile. By means of reactions with neutrons, we can obtain large 
quantities of useful artificial radioactive isotopes and, what is 
extremely important, artificial nuclear fuel. Thus, a nuclear reactor 


ee 
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may be used not only to produce energy, but to produce artificial 
isotopes as well. 

Neutrons formed upon fission have a velocity of tens of thousands 
of kilometres per second. The velocity of thermal neutrons is of the 
order of 1 km/sec (0.025 ev). Such slow neutrons are most effective 
in producing fission. 

Nuclear reactors operating with a natural or enriched mixture of 
uranium-235 and uranium-238 are of great importance. 

Resonance absorption of neutrons will occur in uranium-238. 
Strong absorption occurs, for example, at an energy of 7 ev. When 
a reactor is loaded with a mixture of isotopes, it is absolutely essen- 
tial to decelerate neutrons below this energy value. 

Thus, the basic elements of a reactor are fuel, a neutron modera- 
tor, a neutron absorber to control the multiplication factor, and 
shielding to protect personnel from neutrons and y-radiation emitted 
during nuclear transformations occurring in the reactor. 

The small RFT uranium-graphite reactor of the U.S.S.R. Acade- 

my of Sciences is a good example of a well-designed reactor. This 
reactor is used for various physical investigations and for the pro- 
duction of artificial isotopes. Its power is 10,000 kw and the flux of 
thermal neutrons at the centre of the reactor is equal to 8 x 101% neu- 
trons/cm? sec. The core is in the form of a cylinder the diameter 
and height of which are equal to'1 metre. The cooling water and 
the graphite serve as a neutron moderator. A number of horizontal 
and ‘vertical holes, through which beams of neutrons emerge, pass 
through the reactor shielding. Materials to be irradiated are placed 
in the reactor core through special apertures provided for this 
purpose. 
JA considerable number of nuclear reactors have already been put 
into operation or are in the design stage. Such reactors may differ 
from one another with respect to: 1) the type of fuel used (pure 
nuclear fuel, enriched fuel, or natural uranium in the metal form or 
in the form of a chemical compound); 2) the fuel pattern (space 
lattice, rod lattice, or uniform distribution of the fuel in a solution 
or suspension); 3) the moderator (light or heavy water, graphite or 
beryllium); and 4) the type of cooling (water, gas or no cooling). 
Reactors may be designed to have any power ranging from a fraction 
of a kilowatt to hundreds of thousands of kilowatts and may operate 
with slow (thermal) or fast neutrons, depending on the type and 
quantity of moderator used. 

Reactors are designed to be controlled completely automatically. 
Monitoring is achieved by means of neutron detectors located in the 
reactor walls. Such detectors are able to measure neutron fluxes 
ranging from 1 neutron/cm? sec to 5 X 101° neutron/cm? sec. The 
neutron-sensitive material of a detector may consist of boron-10 or 
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uranium-235 (fused to the electrodes of an ionisation. chamber) or 
boron fluoride gas (i.e., the ionisation chamber is filled with BF3 
gas). In the former case, the chamber is filled with argon, nitrogen, 
helium or air al a pressure of 2 atmospheres. 

The: operation of the ionisation-current amplifiers in the various 
relay mechanisms which transmit the neutron detector signal to the 
actuators controlling the motion of the control rods and safety rods 
must be extremely reliable. 

The position of the control rods varies during reactor operation. 
This is because the quantily of material which absorbs neutrons 
increases as the decay producls accumulate. Some of this “poisonous” 
material, e.g., gaseous products, may be removed from the pile 
automatically. 

Nevertheless, gradual withdrawal of the control rods is necessary 
‘o maintain the neutron density at a constant level. After a period 
of time of the order of 5-20 months the reactor becomes so “poisoned” 
that further operation becomes impossible. The pile must be cleansed 
of decay products and reloaded with new fuel. Fission products 
which are strong absorbers of neutrons include ruthenium-103, 
xenon-131 and 135, neodymium-143, samarium-149 and 151, 
europium-151, 152 and 155 and gadolinium-195. 

Of course, the formation of artificial fuel also causes the mullti- 
plication factor to change. 

If in addition to nuclear fuel there is a quantity of uranium-238 
or thorium in a reactor, such a reactor will not only liberate heat 
but will produce artificial nuclear fuel—plutonium from uranium- 
238 and uranium-233 from thorium. 

A reactor in which the quantity of fissionable material increases, 
or at least remains unchanged, is called a breeder. In constructing 
such a reactor, one must use a fissionable material which allows us 
to obtain an average number of neutrons per fission of more than 
two. The condition for fuel reproduction is that the number of atoms 
splitting each second be no less than the number of uranium-238 
or thorium atoms transformed each second into plutonium or ura- 
nium-233, respectively. 

In order to transform the liberated heat into electrical energy, 
one must proceed exactly in the same way as in the case of thermal 
electric power plants operating on coal. Heat extraction from a reac- 
tor is a very difficult engineering problem. However, this problem 
has been solved. The first atomic electric power station has been 
Operating in the U.S.S.R. for several years and one has also been 
put into operation in Great Britain. During the next few years, 
a number of large atomic electric power stations will be constructed 
in the Soviet Union. 
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225. Artificial Radioactive Products 


Nuclear reactors constitute the most prolific source of radioactive 
products. Every hour of reactor operation yields a definite quantily 
of products of nuclear fission. This process cannot be controlled so 
as to obtain desired products. “Common” fission products having 
a relatively large (sufficient for practical utilisation) decay period 
include: 


Kr, Sr®, Sr, [229 , [131 Xels3 | Cs137 and Bal!9, 


With the further development of nuclear engineering, the quan- 
tity of radioactive fission products (fragments) obtained in reactors 
will reach imposing values. A reactor having a power of 1 kw yields 
a quantity of products which radiates 10° curies in 100 days. At 
present, by no means all of them have practical applications. O! 
most interest are caesium-137 and strontium-90. These have a rela- 
tively long lifetime and are obtained in reactors in relatively large 
quantities. The atomic industry produces them in pieces with an 
initial radioactivity of the order of 1,000-2,000 curies. This is equiva- 
lent to the radioactivity of 1-2 kg of radium. 

Radioactive fission products are used in research work, produc- 
tion (process and quality control), medical treatment, etc. Some 
radioactive fission products are used to sterilise food products, 
antibiotics, vaccines, etc. Such sterilisation requires a very: large 
dose. This is obtained from radioactive preparations yielding 
a thousand curies over a very short period of time—of the order of 
minutes or hours. The radioactivation of irradiated objects is not 
dangerous since fission products radiate B- and y-rays and these do 
not produce appreciable radioactivity in an irradiated substance. 
If packaged goods or goods protected by a natural envelope, e.g., 
eggs or fruit, are subjected to y-ray sterilisation, the bacteria are 
destroyed and new bacteria cannot penetrate through the packing 
or envelope. Large-scale sterilisation by radioactive irradiation may 
replace refrigeration in many cases. 

An important field of application of radioactive isotopes is ra- 
diography. Cobalt-60* and caesium-137 replace X-ray machines in 
the flaw detection of metals. The use of y-rays instead of X-rays 
means that an X-ray tube and its high-voltage equipment are replaced 
by cheap fission products: the basic equipment of a radiographic 
laboratory may be reduced to an ampule of radioactive matter and 
a fluorescent screen or photoplate. Radiography, moreover, has 
broader applications. For example, the inside of a pipe may be 
examined with y-rays, something that is physically impossible to 


* Cobalt-60, which is not a fission product, will be discussed later. 
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do with X-rays because of the size of an X-ray tube. Artificial iso- 
lopes may replace radium salts used to paint the dials of clocks and 
other instruments. 

By means of strontium-90, air may be ionised to eliminate static 
electricity, the accumulation” of which is undesirable in various 
industries. It is useful to ionise the air in combustion engine 
chambers since this increases the rate of combustion. 

By means of strong f-radiators, one can construct low-power 
sources of electric current (less than 1 milliwatt). There are several 
ways of creating electrical devices by means of streams of electrons 
from B-radiators. One of the most promising methods is the irradia- 
tion of germanium or silicon with electrons. Layers possessing uni- 
directional electrical conductivity may be manufactured from these 
elements. The distinguishing feature of semiconductors of the ger- 
manium or silicon type is that under the action of external electrons 
a large number of electrons are set free inside these substances. As 
a result of unidirectional displacement of the liberated electrons, 
a potential difference of the order of 0.2-0.3 volt is created across 
a semiconductor pair. The efficiency of such an electrical device can 
be gleaned from the fact that one external electron produces up to 
200,000 internal electrons. Therefore, in spite of the fact that the 
efficiency in the utilisation of radioactive radiation is only 1%, a 
source: of 50 millicuries with a half-life of 20 years (strontium-90) 
enables us to produce a low-power generator of electrical energy 
at an insignificant cost. 

A specific isotope may be obtained in two ways: by means of 
a nuclear reactor or by nuclear bombardment in an accelerator. 

When we place a substance in a nuclear reactor, it is subjected to 
the action of neutrons. Since most elements easily react with neu- 
trons, it is possible to obtain radioactive isotopes of almost all the 
chemical elements. As a result, nuclear reactors constitute the main 
source of radioisotopes. 

Irradiation with neutrons is accomplished as follows. The substance 
to be irradiated is placed in an aluminium capsule the length of 
which is about 7 cm, diameter about 2 cm, and wall thickness 0.41- 
0.2 mm. If the substance is volatile or may react with aluminium, 
it is placed in a quartz ampule, which is wrapped with quartz fibre 
before being inserted into the aluminium capsule serving to protect 
it mechanically. Ordinary glass cannot be used since it is a strong 
absorber of neutrons. If a large object is to be subjected to irradia- 
tion, a big aluminium box must be constructed for it. Reactors 
designed for the production of radioisotopes are provided with 
tunnels of various cross-sections, through which aluminium boxes 
and capsules may be introduced into the reactor to a great depth. 
After irradiation, the boxes and capsules are placed in lead containers. 
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Radioisotopes are formed in a reactor as the result of neutron 
capture or the dislodgment from the bombarded nuclei of protons, 
and, less frequently, a-particles, by neutrons. At first the number of 
radioactive atoms in an irradiated substance increases uniformly 
with irradiation time. Then, the rate of increase in activily decreases 
until, finally, saturation sets in. For different substances, this 
process proceeds at different rales, depending on the half-life of the 
element. The rate of increase in activily is uniform as long as the 
half-life is 5-10 times greater than the irradiation time. Saturation 
sets in when the irradiation time becomes approximately 5-10 times 
greater than the half-life. 

It should be noted that a nuclear reactor cannot produce the va- 
riety of isotopes produced by a cyclotron. This is quite understand- 
able since in a reactor the conditions of bombardment of atomic 
nuclei are restricted by the nature of the bombarding particle, i.e., 
the neutron, and its range of energies. But there are even objections 
to the use of reactors to obtain large quantities of an isotope such as 
C“, The initial material introduced into a reactor to obtain C™ is 
an excellent absorber of neutrons and therefore cannot be used in 
large quantities. Thus, accelerators have a definite role of their 
own to play in the production of radioisotopes. 

The effectiveness of an accelerator is determined by the energy 
and number of nuclei ejected from the accelerator per unit time. 
he number of nuclei can be easily calculated from the average ion 
current, which, in a cyclotron, is equal to 10-1! a. Knowing the 
charge of an ion, one can easily determine that 10° nuclei, or 


2X 10-2 gm in the case of deuterium, are obtained each 
second. : 


In addition to the above facts, one must also know the “effective 
cross-section” of the reaction in order to determine the rate at which 
a particular radioactive substance will be formed under the action 
of a beam of nuclei. This quantity is designated by o and its value 
is given in cm?. It has the following significance. If S is the area of 


a bombarded sample, -~ is the probability that the nuclear projectile 


will hit the target nucleus and produce the given reaction. The order 
of magnitude of o is usually about 10-°* cm?. In the case of certain 
nuclei, the absorption of neutrons increases sharply at definite 
neutron velocities. For example, cadmium is a strong absorber of 
slow neutrons with an energy of 0.18 ev. The effective cross-section 
of this reaction is of the order of 7,000 x 10-24 cm?. 

The cross-section of a given reaction depends to a great extent 
on the energy of the bombarding particle. If a sharp peak occurs 
in a curve of cross-section plotted against incident particle energy, 
the peak value is referred to as the resonance cross-section. 
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It is evident from the values given above that the production of 
significant quantities of isotopes often requires many weeks of 
accelerator operation. 

The radioactive elements obtained from reactors and accelerators 
are widely employed as tracers (whence the designation “tagged 
atoms”) in practically all branches of science and engineering. The 
mass of material required for all such purposes is not very large, 
but since elaborate industrial facilities serve to produce only small 
amounts of radioactive isotopes this industry is rapidly expanding 
to meet the growing demands. 

The presence of a radioactive isotope in a substance is generally 
determined by means of a counter. In this manner, an activity of 
less than 107! millicurie can be measured. Thus, a quantity of 
radioactive phosphorus having a weight of less than 10-!© gm can 
be detected. 

Widely used isotopes include the following: Cobalt-60, which is 
B--radioactive with a half-life of 5.2 years; since it is a strong B-ra- 
diator it is widely used in gamma-raying and irradiation. Carbon-14, 
which has a half-life of 6,360 years; widely used in biochemistry, 
geochemistry, and in the study of the kinetics of chemical reactions. 
Phosphorus-32 and sulphur-35, which are B--radioactive with a half- 
life of 14.3 and 87 days, respectively; one of their most important 
uses is in agriculture in the study of fertiliser assimilation by plants. 
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Theoretical calculations indicate that atomic nuclei of almost 
all elements can serve, in principle, as sources of energy. It turns 
out that any nucleus heavier than the nucleus of a silver atom pos- 
sesses more energy than the components into which it may be divid- 
ed. All heavy nuclei release energy when they split up. The heavier 
the nucleus, the greater the magnitude of this energy. That is why 
uranium is the “best” nuclear fuel. 

However, light nuclei can also serve as sources of energy. Theoret- 
ical calculations indicate that a nucleus obtained by the fusion 
of two light nuclei will possess less energy than the original parti- 
cles. Therefore, energy is released upon fusion of light nuclei. Here, 
too, the further away from the mid-point an element stands in the 
Mendeleyev periodic table, the greater the amount of energy 
released in such a reaclion. The greatest amount of energy is obtained 
from the fusion of nuclei of hydrogen atoms. 

What conditions are required to achieve fusion between light par- 
ticles? Nuclear bombardment cannot yield the desired result since 
a charged particle is decelerated rapidly in a substance. The only 
way to achieve fusion is to raise the temperature. It is easy to calcu- 
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late the temperatures that atomic nuclei require to enable them lo 

approach each other closely, i.e., to overcome electric repulsion. 
Calculations indicate that the temperature at the centre of the 

Sun is 20 million degrees. Knowing that the energy associated with 


; 1 ym ; ; 
each degree of freedom is equal tozkI , one can determine the average 


kinetic energy of a particle at this fantastically high temperature. 
This energy is equal to merely 3,000 ev. Now, let us calculate from 
the formula for the potential energy of electrical interaction, U = £ : 
how closely two protons will approach each other. It turns out that 
the distance will be equal to 5 x 10-14 cm. As we know, the radius 
of a nucleus is considerably less than this value. Nevertheless, ther- 
monuclear reactions, i.e., reactions occurring at a high temperature, 
are possible in the Sun. Calculations which take into account the 
tunnel effect, and the fact that in every gas, including a gas of 
nuclear particles, there are particles the velocities of which consider- 
ably exceed the average, indicate thatin a year one atom ina million 
takes part in nuclear fusion. This small fraction is sufficient to 
account for solar activity. j 
Under terrestrial conditions, such high temperatures have been 
repeatedly created in the United States and the Sovict Union during 
hydrogen bomb tests. Temperatures of scores of millions of degrees 
are created during uranium bomb explosions. If a substance whose 
nuclei are capable of combining and releasing energy is located 
within the zone of this explosion, a thermonuclear reaction, the 
energy of which is many times greater than that of a uranium bomb, 
will occur. The uranium bomb in this case serves to trigger the 
thermonuclear reaction. a 


Here are examples of the most easily realised reactions which 
release large quantities of energy: ; 


D?+H!=He%; Li?-+H!=2He'; 
H?+H!—Het; C¥4+HI=N¥, 


Thermonuclear reactions can occur only at temperatures which 
give nuclei a thermal velocity sufficient to overcome, with appre- 
ciable probability, the Coulomb potential barrier. 

Of greatest interest are the reactions in deuterium and in mixtures 
of deuterium and tritium since these require the least amount of 
energy. A temperature of 200,000° is required to obtain one neutron 
per second in a gram of deuterium (in accordance with the reaction 
4D? + ,D? = .He® + on’). In a highly rarefied gas, an even higher 
temperature (of the order of 500,000°) is required for the same pur- 
pose. At such a temperature, deuterium (like other substances) will . 
form a plasma of nuclei and electrons. To transform deuterium to 
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this slale requires very little energy—of the order of several kilo- 
watt-hours. However, the difficulty does not lie in transmitting 
this energy to deuterium, but rather in providing thermal confine- 
ment, i.e., the deulerons must preserve the corresponding kinetic 
energy for a long period of time. 

A. possible solution to this problem was proposed by Sakharov 
and Tamm. They suggested that thermal confinement be achieved 
by utilising the electrodynamic forces which constrict parallel cur- 
rents. When a voltage is applied across a plasma, it is constricted, 
i.e., a plasma column is formed (pinch effect). This constriction 
reaches its culmination when the temperature of the plasma increases 
to a million degrees. At such instants, deuterium nuclei initiate 
a thermonuclear reaction. This theory has been confirmed by Soviet 
scientists and the method may prove to be the most practical means 
of utilising the energy of thermonuclear reactions. Experiments 
have been conducted with electric discharges in tubes with discharge 
gaps of up to 2 metres, and currents of 100,000 to 2,000,000 amperes 
have been investigated. 


CHAPTER XXXII 
ATOMIC STRUCTURE, OF BODIES 


227. Polycrystalline Substances and Monocrystals 


As a rule, crystals begin to grow around a very large number of 
centres in a melt or solution. If special measures are not adopted, 
a polycrystalline substance rather than a monocrystal will form as 
the result of crystallisation. Under a microscope such a substance 
seems to consist of individual grains (see Fig. 247). Each grain is 
a crystal which has an_ irregular 
haphazard form due to the fact that 
as its normal growth has been impeded 
by neighbouring crystals. Most bod- 
ies commonly encountered, partic- 
ularly metals and rocks, are poly- 
crystalline substances. 

The boundary between grains is 
revealed by etching with an ap- 
propriate solvent. Thisis due to the 
fact that most of the impurities of 
a substance accumulate al the grain 
boundaries. The interlayer between 
crystals differs from the “body” of 
the grain not only in that it contains 
foreign atoms, but in that ils atoms 

9°47 have a_ disturbed (transitional) 
; arrangement. The basic structure of 

the boundary between grains is 
clearly visible under a microscope as peculiar, smooth “paths”. The 
usual size of grains in metals and rocks is 10-4-10-> cm. 

A single crystal (monocrystal) of any crystalline substance may 
be found in nature or artificially grown. A monocrystal is distin- 
guished by its regular shape, i.e., plane faces, straight edges, and 
symmetry, in other words, the proportionality of ils component 
parts. This regular shape reflects a crystal’s internal properties, 
which enable us experimentally to distingush a crystal from a bit of 
material given such a shape artificially. It is also not difficult to 
recognise a crystal when its characteristic features are hidden. Thus, 
a sphere may be fashioned from a large crystal of rock salt but, 
when it is placed in water, surface material is dissolved at a non- 
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uniform rate and, as this process goes on, its accidental shape tends 
to be transformed into the polyhedral shape which is natural for 
this substance. A monocrystal can be easily distinguished from 
a polycrystal by means of X-ray analysis. 

A naturally formed crystal has the shape of a polyhedron. As in 
the case of every polyhedron, a crystal has a certain number of faces 
(p), edges (r) and corners (e), 
which are related to one another 
as follows: p +e =r-+ 2. For 
example, a cube has 6 faces, 8 
corners and 12 edges. 

Crystal faces are arranged in 
bands or zones. A system of faces 
the intersections of which are 
parallel edges is called a zone, 
and the direction of these edges 
is called the axis of the zone. 

Crystals of one and the same 
substance may differ considerably 
with respect to shape, but it has 
been long known that a given sub- 
stance has characteristic angles 
between faces and edges. (Depend- 
ing on chance, one component 
of a crystal may grow more than 
another; as a result, apparently, 
the proportionality between com- 
ponents may be upset.) This im- 
portant rule, which may be called the law of constant angles, is 
illustrated in Fig. 248. In the figure, we see four different crystals 
of silicon dioxide. It is seen that the number of faces and their relative 
dimensions differ from specimen to specimen, but the angles between 
corresponding faces and edges remain unchanged. 


228. Space Lattice 


The distribution of matter in a crystal may be represented by 
a three-dimensional periodic function. This is the fundamental rule 
lying at the basis of crystal investigations. 

Fig. 249 shows a wall-paper pattern. A certain element of this 
pattern is repeated in two directions. Consider any point A in the 
figure. A system of lines may be drawn through the selected points 
(nodes) as shown. A pattern element the repetition of which yields 
the full pattern is enclosed within a cell of the resulting grid. Evi- 
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dently, the entire pattern can be obtained from a single cell by 
means of parallel translations of the cell vectors @ and b. 

A crystal constitutes a space lattice, not a plane lattice. An ele- 
ment of a crystal is a parallelepiped based on three translational 
vectors a, 6, c, which may be selected, generally speaking, in an 
infinite number of ways. Such a 
parallelepiped will be called an 
elementary or unit cell, the vectors 
a, b, e the basic translational 
vectors, or simply vectors, and 
their lengths a, 6, ¢ the basic rep- 
elition periods or lattice spac- 
ings. The lattice is described in 
a system of coordinates the axes 
of which coincide with the direc- 
tions of the basic vectors. Differ- 
ent ways of selecting basic vec- 
tors, i.e., an elementary cell, are 
illustrated for a two-dimensional 
case in Fig. 250. An elementary 
cell in the general case is an ob- 
lique-angled parallelepiped with 


edges a, b, cand angles « = b,c, 


Bp =c,a, py =a, b. The six quan- 
tities which uniquely describe 
an elementary cell are called its 
parameters. Since the entire lat- 
tice is determined when an elemen- 
tary cell is given, the above 
quantities are sometimes called 
the parameters of the lattice. 

A cell in the form of an oblique-angled parallelepiped is said to 
be triclinic and if @ = y = 90°, monoclinic. A cell in the form of 
a right-angular parallelepiped is said to be rhombic and if in addi- 
tion a = b, tetrahedral. Ifa = bAc,a =f = 90°, and y = 120°, 
a cell is said to be hexagonal. The simplest cells have the form 
of a cube. 

If one of the lattice points is selected as the origin of the coordi- 
nate system, the radius vector of any other lattice point is given 
by the formula 


Fig. 249 f 


Rinnp = ma-+nb + pe, 


where m, n, p are whole numbers representing the coordinates of 
these nodes. The indicated numbers are called the indexes of the 
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nodes. The set of three indexes describing lattice points is designated 
by the nodal symbol [[mnp]]. 

There are an infinite number of nodal lines and nodal planes. 
Nodal lines and nodal planes are represented in a lattice by infinite 
families of parallels. The transition from one line to another of the 
same family, or from one plane to another, occurs by translation 
along a vector joining two nodes of these lines or planes. Each family 
of nodal lines is described by the lattice spacing along a nodal line 
and the direction, i.e., incline, to the selected coordinate axes. 


Fig. 250 


To describe a family, we select the line passing through the origin 
of the coordinate system. A nodal line is described uniquely by the 
indexes u, v, w, of the first lattice point lying on this line. The indexes 
of this lattice point are called the indexes of the line and are desig- 
nated by [uvw]. If an index is negative, a minus sign is placed above 
the numeral. The symbol [100] represents the a-axis of the lattice, 
[010] the b-axis, and [0041] the c-axis. The lines [011] and [011] 
represent plane diagonals in the face be. Of course, [014] and [014] 
are one and the same line. Distinguishing between these two designa- 
tions has significance only if we wish to emphasise the polarity of 
the direction. The spatial diagonals of a cell have the symbols [111], 
1111], [4414] and [111]. There are four of them, corresponding to the 
existence of eight quadrants; the other four symbols represent the 
same lines, but with reverse ‘polarity. Thus, [144] is anti-parallel 
to [111], éte. 

A space lattice can be constructed as follows. First, an infinite 
plane-lattice (nodal plane) is formed by means of two translational 
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vectors; then, a space lattice is formed by means of a third transla- 
tional vector which does not lie in this plane. A crystal lattice can 
be represented by families of nodal planes in an infinite number of 
ways. Every family of nodal planes consists of parallel planes sepa- 
rated from one another by equal distances. For a given lattice. 
specification of the interplanar distance and the orientation of one 
of the planes relative to the selected coordinate axes completely 
describes a family-of nodal planes. It is also sufficient to give the 
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Fig. 254 


orientation relative to the selected axes of the plane closest to the 
origin. The distance of this plane from the origin will be equal to 
the interplanar distance of the given family. 


Let this plane intersect the lattice axes at the coordinates -. ; 4 
L : 


Cc " A . é 
and 7 ie., fractions of the basic lattice spacings. The numbers h, 


k and 1, which describe the orientation of the plane, will be called 
the indexes of the plane. It is easily seen that h, k and I are whole 
numbers. One way of showing this is as follows. Consider a plane 
passing through an initial lattice point and another plane, of the 
same family, displaced by an amount a. This is shown in Fig. 254. 
Other planes will pass through these nodal planes, but they must ‘be 
separated from one another by equal distances. Therefore, the repe- 
tition periods along the selected axes will be divided by the nodal 
planes into a number of equal parts. 
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a 
"k 
and + of the lattice spacings is described by the set of three indexes 


The plane closest to the origin and intersecting the axes at + 


h, k and l. Its symbol is designated by enclosing these indexes be- 
tween round brackets: (hkl). For example, the plane (236) intersects 


the axes at the coordinates = : as and = . Any plane which intersects 


the axes at coordinates which are a multiple of these values is a 
member of this family. Thus, in the case under consideration, the 
successive planes beyond the 
one closest to the origin will 
intersect the axes at the fol- 
: : ; 2 c 
lowing coordinates: a, =), ae 
us a, b, z , etc. 

If a plane intersects the axes 
at negative coordinates, this 
is indicated by a minus sign 
above the corresponding index. 
It is evident that the planes 
(hkl) and (hkl) belong to the 
same family. Therefore, all 
the signs of the indexes of a 
plane may be reversed. 

Ifa plane is parallel to aco- @ 
ordinate axis, the correspond- 
ing index is equal to zero. Thus, 
(110) is a plane that is parallel 
to the c-axis, (001) is the lattice plane ab (see Fig. 252). (010) is the plane 
ac, and (100) is the plane be. Planes passing tlirough one of the axes 
and one of the diagonals have indexes consisting of two ones and 
one zero. For example, the plane (101) is a plane which is parallel 
to the b-axis and passes through the diagonal extending from the 
terminal of vector +a to that of vector +e (not the diagonal passing 


through the origin). The plane (101), which passes through the 
terminals of the vectors —a@ and —e, belongs to the same family. 


The plane (101) and its “reverse side” (104) are also parallel to the 
b-axis and pass through the diagonals ac which do not begin from 
the zero lattice point, but extend from the terminal of vector +a 
to that of vector —e and the terminal of vector —a@ to that of vector 
-+e, respectively. 

A symbol consisting of three units refers to planes passing through 
three diagonals. These planes pass through the terminals of all 
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three lattice vectors. Thus, the plane (111) passes through the ter- 
minals of the vectors —a, —b and +e. 

The indexes of a family of nodal planes are at the same time Lhe 
indexes of crystal faces. Two parallel faces have the indexes (hi!) 


and (hkl). 
229. Cell Selection and Crystal Symmetry 


The vectors a, b and ¢ of a lattice may be selected in a variety of 
ways. If there are no lattice points within an elementary cell, we 
call it a primitive cell. 

Various ways of selecting a primitive elementary cell are shown 
in Fig. 253. In view of the periodicity of a space lattice, the magni- 
tude of the volume associated with each lattice point is a constant. 


Fig. 253 


Its value is equal to the volume of a primitive elementary cell, re- 
gardless of the manner in which such a cell is selected. Since each of 
the eight lattice points at the corners of such a cell is “shared” by 


eight cells, _ of the lattice points bélongs to the given cell. Thus, 


on the average, there is one lattice point per cell. 

In a number of cases, it is expedient to select an elementary cell 
so that its volume is greater than that of a primitive cell. Thus, 
in order to take maximum advantage of the symmetry of a crystal, 
we often select an elementary cell with an additional lattice point 
at face centres or at the centre of the cell. Three cases are encoun-| 
tered frequently: 

1) Body-centred cell. An additional lattice point is located at the 
intersection of the spatial diagonals of the cell. In this case, there 


are two lattice points per cell: [{000]] and (oss . The lattice 
point at the centre of a cell belongs entirely to the given cell. The 
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eight lattice points at the corners are shared jointly by eight cells, 


i.e., x of each of these lattice points belongs to the given cell. 


2) Face-centred cell. An additional lattice point is located at the 
centres of a pair of faces, e.g., ab. In this case, too, there are two 


lattice points per cell: [[000]] and [las 0] | ; 


3) All-sided face-centred cell. An additional lattice point is locat- 
ed at all face centres. In this case, there are four lattice points per 


cell: {{000N, [[0 sal [[= 0 =| and [les 0|]. 


The following designations are commonly used: P—primitive 
cell; A, B and C—face-centred cells with a lattice point in faces 
be, ac and ab, respectively; F—all-sided face-centred cell, and J— 
body-centred cell. 

As emphasised earlier, a lattice point is an arbitrary point, but 
for convenience is selected in a specific manner. The succeeding 
lattice point is separated from the selected one by a distance equal 
to the lattice spacing. Thus, there is one lattice point per primitive 
cell. 

All the atoms of a primitive cell may be replaced by lattice points. 
Usually, a point of intersection of symmetry elements is taken as 
a lattice point. 

A primitive cell may consist of many aloms. When there are 
many atoms in a cell, the structure is described by the coordinates 
of the atoms in an elementary cell. 

The model of a crystal as a space lattice is in full accord with 
experimental data. Crystal edges and faces correspond to nodal 
straight lines and planes. The angles between crystal faces and 
edges are the same for all crystalline objects of a given chemical 
compound, 

The symmetrical features of crystal structure: may also be deter- 
mined from the space lattice model. 

Crystals of different substances have different symmetry. If a 
crystal is well formed, its symmetry is self-evident. It is clear that 
planes and axes of symmetry may be passed through a crystal in 
a specific manner. 

The external symmetry of a crystal can be explained by its inter- 
nal structure, i.e., the symmetry of the space lattice. 

In addition to axes of symmetry, symmetry elements encountered 
in crystals include mirror planes and inversion or symmetry centres. 
Fig. 254 illustrates the operations which may be performed by 
means of these symmetry elements. 

It has been long known that axes of symmetry of fifth order and 
axes of symmetry of higher order than the sixth do not occur in 
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crystals.* Basing himself on this fact, A. V. Gadolin proved that 
there can be only 32 groups of symmetry among crystals. 

Before the development of the space lattice theory, it was unclear 
why axes of fifth, seventh, etc., orders were not encountered in 
cryslals. This and other features 
of crystal symmetry can be ex- 
plained by the space lattice the- 
ory. 

Let us consider rotation of a 
lattice plane. Rotations which 
are not possible for a plane will 
certainly not be possible for the 
entire lattice. 

Fig. 254 Assume that an axis of n-th or- 

‘der passes, through the lattice 

point B and that the identical one closest to it passes through the lattice 
point A (see Fig. 255). Rotation about axis B transfers lattice point 
A to A’ and rotation about axis A transfers lattice point B to B’. 
It is seen from the diagram that B’A’ = AB (1+ 2 cosa). But 


Fig. 255 


the distance A’B” must be a multiple of the lattice spacing AB, 
for A’B’ is parallel to AB. Therefore, 2 cosa must be equal to 
a whole number. It follows that cos a can assume only the values 0, 


+ + and +1, and @ the values 60°, 90°, 120°, 180° and 360°. This 


also follows from the definition of a closed symmetric operation: 
the rotation angles are equal to 360° divided by a whole number. 
Thus, it is possible to have rotational axes of symmetry of sixth, 
fourth, third, second and first orders in a crystal. 


* If a body is turned about a certain axis by an angle on so that the 


figure obtained coincides with the original figure, such an axis is called an 
axis of symmetry of n-th order. For example, a 3-faced prism based on an isos- 
celes triangle has an axis of symmetry of third order which passes through the 
centre of the triangle and is parallel to the edges of the prism. 
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One can prove easily that an axis of symmetry is a nodal line 
which is normal to a nodal plane. 

The symmetry of a crystal is determined by the symmetry of the 
space lattice. But it should be realised that the symmetry of a lat- 
tice is considerably richer. Only 32 groups of crystal symmetry exist 
but, as was first shown by E. S. Fyodorov, the founder of structural 
crystallography, a space lattice has 230 types of symmetry (Fyodo- 
rov groups). ; 

The richer symmetry of a lattice is due to the fact that in addition 
to closed symmetric operations it includes a symmetry element 


Fig. 256 


which is not possible for a body, i.e., translation. A symmetric 

operation consists in the displacement of a body to a position which 

is indistinguishable from its original position. Therefore, in the case 

of an infinite lattice, a displacement along one or another nodal 
line is a symmetric operation. 

Translation introduces the following new symmetry elements: 

1) a combination of rotation and translation—screw axes; and 2) a 

~ combination of reflection and translation—slip planes. A screw axis 

of fourth order is shown in Fig. 256 (right diagram). Each of points 

2, 3, 4 and 5 is obtained from the preceding one by a 90° turn ané 


displacement along the axis by + of a period (¢). In Fig. 256 (left 
diagram) we see triangles reflected in a plane Q and slipped along 
AA’ by + of a period. 


Re «nna 
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230. The Packing of Particles in a Crystal 


Figures having a definite volume and shape can be stacked or 
packed. It is by no means clear to what extent a formation of atoms 
will conform to this picture. Here, the answer to the following ques- 
tion is of prime importance: If we attribute a definite shape to an 
atom or group of atoms, will this even roughly correspond to a mini- 
mum on the potential curve of particle interaction in a given direc- 
tion? Moreover, to what extent will the volume attributed to the 
atom or group of atoms encompass all the electrons, including the 


Diketopiperazine 
Fig. 257 


valence electrons, belonging to this atom or group of atoms? If it 
turns out that describing the limits of an atom or molecule by a 
definite contour has physical meaning, we will have ascertained at 
the same time how this shape is manifested when a crystal is formed 
from particles. 

The nature of interactions between atoms in a crystal is infinitely 
varied. However, several limiting cases can be singled out: pure 
ionic bonds, homopolar bonds, metallic bonds and molecular bonds. 
As examples, let us consider the structures of rock salt, zinc sulphide, 
iron and the organic substance diketopiperazine (see Fig. 257). 
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In the first three cases, the absence of a tight group of atoms is 
characteristic. A molecule cannot be distinguished in a crystal of 
rock salt. léach atom of sodium has six perfectly equal chlorine 
atoms as neighbours. Nor can a molecule be distinguished in the 
other two compounds, which are examples of homopolar and metallic 
bonds. ; 

In ionic crystals, the particles consist of positive and negative 
ions, which allract one another in accordance with the laws of elec- 
trostatics. Everything said about ionic bonds in molecules applies 
to crystals. The nature of ionic structures is communicated reason- 
ably well if the ions are represented by spheres having definite 
ionic radii. The values of ionic radii in ionic molecules differ little 
from those in crystals. 

In crystals with a homopolar bond, every two atoms are linked 
by a pair of electrons. In this manner each zinc atom is linked with 
its surrounding sulphur atoms, and each sulphur atom with its sur- 
rounding zinc atoms. If a sign of a homopolar bond were considered 
to be an indication of a molecule, it could be said that an entire 
crystal constituted a single molecule. It is physically meaningless 
to consider homopolar crystals as constructed of contiguous spheres. 
The dimensions of free atoms of sulphur and zinc are considerably 
greater than the distance between them in zinc sulphide. A homo- 
polar bond brings the atoms into close contact and makes regions 
in which electrons of these atoms are located common regions. If 
the structure of zinc sulphide consisted of contiguous spheres, 
a large portion of the electron cloud would be located outside the 
spheres, i.e., only 25 per cent of the volume would be occupied by 
spheres. 

Metallic bonds will be discussed in Chapter XX XVII. Here, a few 
words on this subject will suffice. In metals and alloys, outer elec- 
trons are common, forming an electron “gas”. The lattice of a metal- 
lic compound consists of atomic residues (positive ions) “cemented” 
by electrons. Here, too, it is physically meaningless to represent the 
structure as contiguous spheres, in spite of the fact that formally 
the structures of certain metals can be represented as closely packed 
spheres. 

In crystals of the same type as diketopiperazine the molecules are 
distinct. They can be easily recognised since the intermolecular 
distances are considerably greater than the intramolecular distances. 
By studying the arrangement of molecules in crystals, crystallo- 
graphers have been able to pick out intermolecular radii of portions 
of spherical surfaces describing the limits of a molecule. The model 
of a molecule formed of microscopic spheres of intermolecular radius 
is based on an analysis of crystal structures. Such a geometric repre- 
sentation of crystals formed of molecules is quite justified since 


38* 


596 Atomic Structure of Bodies 


most of a molecule’s electron cloud is contained within the contour 
of the molecule. 

It must be concluded that the representation of the component 
particles of a crystal as geometric figures is meaningful in two cases: 
in ionic and in molecular crystals. On the other hand, such a rep- 
resentation is meaningless in the case of homopolar crystals and 
metallic compounds. 

Now, the following question arises: How is the shape of ions and 
molecules manifested in the formation of a crystal? The answer is 
that it is manifested in the compact packing of particles. Experi- 
ments indicate that molecules are always packed in such a way that 
a “projection” of one molecule fits into a “depression” of another. 
There is a clear tendency for the molecules to become so oriented 
with respect to one another that the volume of an elementary cell 
is as small as possible. The situation is similar in the case of ionic 
crystals. The stacking of spheres occurs in such a manner that the 
large spheres fit closely together, while the small spheres (ions) fit 
into the empty spaces of the basic structure. 

In representing ions by spheres, and molecules by spatial figures, 
we find that the “empty” space is equal to 25-35 per cent of the total. 

Close packing in molecular and ionic crystals provides basic 


proof that shape and volume are meaningful attributes of atoms 
and molecules. 


231. Molecular Crystals 


; The assertion that molecules are bound by intermolecular forces 
in a molecular crystal is, of course, pure tautology. What then can 


ri mai about this concept and about the nature of intermolecular 
orces? 


The forces of attraction acting between molecules are interaction 
forces of electrical origin. A molecule possesses electrical properties 
even though it is electrically neutral, i.e., its total charge is equal 
to zero. 

In order to take these properties into account, all we need do is 
consider a molecule to be a dipole. As we know (see p. 252), dipoles 
attract each other. If-all the molecules of a substance are polar, 
d.e., have a constant dipole moment, inherent dipole interaction 
exists between them. If a portion of the molecules are polar (the 
substance consists of several kinds of molecules), such molecules 
may induce dipole moments in neighbouring molecules. This is 
called induced interaction. 

Finally, it may be said that dipole interaction always exists be- 
tween momentary dipoles. (Every molecule is a momentary dipole 
at. every instant of time, since the centre of gravity of electrons 
never coincides with that of nuclei.) 
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It is not surprising that the potential energy of interaction is 
represented by a very complex function. The calculation of the 
dependence of the potential energy of interaction on the mutual 
orientation and spacing of molecules in a crystal is hardly possible. 
The potential of the attractive forces can be calculated only when 
the distances between molecules are large. When these distances 
are much greater than the dimensions of the molecule, the forces 
of interaction are inversely proportional to 7’. This is not the case 
in crystals and liquids. 

The enumerated forces of attraction are counterbalanced by the 
forces of repulsion between the electron clouds of neighbouring 
molecules. The Pauli exclusion principle does not permit the inter- 
penetration of clouds, so the forces of attraction bring the molecules 
into “contiguity”. 

Although the binding energies of different molecular crystals may 
differ considerably, intermolecular distances in no way depend on 
molecular polarity, binding energy, etc. Evidently, the separation 
between molecules is determined mainly by the well-defined bound- 
aries enclosing the electron cloud of a molecule. 

The shape of a molecule determines the nature of the molecular 
packing as well as the intermolecular distances. Molecular polarity 
and other characteristics of the forces of attraction not,only do not 
affect the intermolecular distances, but also do not disturb the 
tendency to compact packing of molecules. Thus, in practically all 
cases, minimum energy is achieved by compact packing of, mole- 
cules. Since molecular crystals strictly obey the principle of compact 
packing, only an exceedingly small variety of structural types and 
symmetry are encountered among such crystals. 

It is convenient to view the packing of molecules in crystals as 
a tight packing of compact layers. Two types of such layers are 
encountered. These are illustrated in Fig. 258. The one with right- 
angled cells has greater symmetry. In this case (more than 90 per 
cent of organic crystals are formed of such layers), the molecules 
are stacked in the characteristic zig-zag manner shown in the figure. 
The rows of molecules forming a layer are connected by a screw 
axis of the second order (2;). This means that one row of molecules 
can be transformed into an adjacent one by a 180° turn and a displace- 
ment of half a period along the axis. 

In compact layers, each molecule has six close neighbours. When 
the layers are stacked, a molecule usually obtains six additional 
close neighbours—3 above and 3 below. Thus, the total number 
of such neighbours becomes equal to 412. 

Crystals having a high order of symmetry are rarely encountered 
in the world of molecular crystals. It is not possible to pack com- 
pactly unsymmetrical molecules in symmetrical crystals. 
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If a molecule possesses symmetry, this does not mean that the 
crystal also has such symmetry. A molecule of naphthalene has a 
high order of symmetry; three mutually perpendicular planes of 
mirror symmetry may be drawn through it (see Fig. 259). If these 


symmetry elements were preserved in a formation of molecules, the 
packing would be insufficiently compact. Therefore, the symmetry 
elements of a molecule which prevent greater compactness are “lost” 

in the formation of a crystal. 


/ 
ms The preservation of a centre 
-y¢- é of inversion is possible without 
; Nie Ae sacrificing compactness of mo- 
aa lecular packing. A crystal 
formed of molecules possessing 


this symmetry element usually 
does not preserve other sym- 
metry elements of the mole- 
cules, but does preserve a 
centre of inversion. 

In other cases as well, the outcome of the tendency to symmetry 
7 oppesee to the tendency to compactness can be reliably pre- 

icted. 

An example of a packing arrangement typical of molecular crys- 
tals is provided by diketopiperazine (see Fig. 257). The molecules 
have a high order of symmetry, but the crystal preserves only a cent- 
re of inversion. A molecule, of course, does not cease to be highly 
symmetrical simply because a crystal of such molecules does not 
possess its symmetry elements. 


Fig. 259 
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232. Compact Packing of Spheres 


A very important class of ionic crystals may be represented by 
a compact packing of spheres. 

Most anions are larger than cations. In such cases, crystals con- 
stitute a compact packing of anion spheres between which cations 
are located. This is how silicates, one of the largest groups of natu- 
ral inorganic substances, are formed. In silicates, the cations are 
located in the empty spaces of a 
compact structure of oxygen anions. 

Let us examine the laws of com- 
pact packing of spheres, i.e., the 
fundamental structures of a great 
number of crystals. The only pos- 
sible arrangement of acompact layer 
of spheres is shown in Fig. 260. 
Each sphere has six neighbours. To 
form a compact packing arrange- 
ment, one must place the spheres 
of a second layer in the spaces of 
the layer below it. It is not possible 
to fill all the spaces with spheres 
of the same size: every second 
space in the figure is filled (crosses 
denote the spaces of the first layer which are filled by spheres of the 
second, and dots denote the spaces which remain vacant). 

There is also only one possible arrangement for compact packing 
of two layers of spheres. For three layers, however, the situation is 
different. In order to achieve compact packing in this case, one 
must place the spheres of the third layer in the spaces of the second, 
but this can be done in two ways: the centres of the spheres of the 
third layer may be placed either above the centres of the spheres 
of the first layer or above the spaces denoted by dots. Both three- 
layer structures are packed equally compactly; however, they differ 
significantly from one another. When a fourth layer is added to the 
structure, the number of possible packing arrangements becomes 
even greater: from the two three-layer structures, one can make four 
four-layer structures. In the case of a five-layer structure, there are 
five possible arrangements, etc. It is evident that the number of dif- 
ferent arrangements of spheres packed equally compactly increases 
very rapidly with increasing number of layers. 

Now, let us compare a crystal lattice with such an arrangement 
of spheres. A crystal may be represented as a structure of spherical 
‘atoms in which the arrangement of layers is repeated exactly after 
a certain number of layers. If such a sequence begins with the four- 


Fig. 260 
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teenth layer, for example, this means that a cell is thirteen layers 
high. In such a case, the fourteenth layer is above the first, the 
fifteenth above the second, the sixteenth above the third, etc. 
The simplest packing arrangement consists of two layers: the 
third layer lies above the first, the fourth layer above the second, 
etc. (see Fig. 261, right). This is a hexagonal compact packing 


Fig. 261 


arrangement. A cell of such a crystal is shown in the lower right- 
hand corner of the figure. The dots and crosses denote the locations 
of the centres of the spheres. 

Three-layer crystals, in which the fourth layer is a repetition of 
the first, the fifth of the second, etc., are very common (see Fig. 261, 
left). In the lower left-hand corner of the figure, where only the 
centres of atoms are indicated, we see that a cubic elementary cell, 
centred in all faces, may be selected. Here, the compact layers are 
arranged perpendicular to the spatial diagonal of a cube. Such a 
structure is called a cubic compact-packing arrangement. 

Empty spaces remain in a packing of spheres of equal size. It can 
be easily calculated that the volume of such spaces is equal to 
about !/, of the overall volume. There are two kinds of such empty 
spaces: one is surrounded by four spheres the centres of which are 
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located at the vertexes of a regular tetrahedron (see Fig. 262a); the 
other is surrounded by six spheres—the centres of these spheres 
form a regular octahedron (see Fig. 262b). The first kind is smaller 
in size, but there are twice as many of them as the second. 

It can be shown that in any compact arrangement of equal spheres 
there are two small empty spaces and one large one per sphere. 
Small spheres can fit into these spaces but if they are somewhat too 
large, they cause the large neighbouring spheres to move apart, 
loosening the compact packing 
arrangement. 

Since different packing arran- 
gements are possible with equal 
numbers of spheres, and small 
spheres may fill the empty spaces 
in different ways, ionic crystals 
have a great variety of struc- 
tures. 

In crystals of common salt, a 
compact three-layer structure is 
formed by large chlorine ions 
(light-spheres in Fig. 257), and so- 
dium ions(dark spheres) fill allthe large spaces; hence, every sodium 
atom is surrounded by six chlorine ions. In iron disulphide (pyrite), 
a compact two-layer structure is formed by large sulphur ions; iron 
ions fill all the large spaces. In a crystal of lithium oxide, the chemi- 
cal formula of which shows that there are two lithium atoms for 
every oxygen atom, the compact structure is formed by large oxygen 
ions. Since lithium ions fill all the small spaces, each lithium ion 
has four neighbours (oxygen ions). In a crystal of cadmium chloride, 
the chemical formula of which shows that there are two chlorine 
atoms for every cadmium atom, the compact structure is formed 
by large chlorine ions; cadmium ions fill large spaces but not all, 
i.e., they fill the large spaces of every third layer of chlorine ions. 
We have presented, of course, only the simplest “patterns” of the 
filling of empty spaces in compact packing arrangements. 


Fig. 262 
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The largest group of crystals consists of bodies formed of mole- 
cules. Ionic compounds also constitute a fairly large group. As 
indicated already, the representation of a crystal in these cases as 
closely packed particles is entirely justified. However, it is neces- 
sary to examine those structures in which the direction of the bonds 
between atoms, the deviation of the electron cloud from spherical 
symmetry, etc., are the cause of structural arrangements which 
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cannot be represented so simply. Such exceptions include structures 
of atoms bound by common electrons. 

Most metals have structures consisting of body-centred cubic 
cells. In such crystals, each atom has eight neighbours rather than 
twelve, as is the case in a compact packing of spheres. This is the 
case, for example, for atoms of iron (see Fig. 257). The lattice of 


Fig. 263 


iron is cubic; iron atoms are located at the corners and centres of 
cubes. Lithium, potassium, caesium and a number of other sub- 
stances possess such a structure. 

In Fig. 263, the structure of crystalline mercury is compared with 
an ideally cubic compact packing arrangement. It can be seen that 
the nature of the arrangement of atom centres is the same in both 
cases, but in the former the distance between layers is less, and the 
distance between atoms in a layer is more than in the latter. This 
is analogous to a compact packing of slightly flattened spheres. 

Many examples exist of such more or less “damaged” compact 
packing structures. In the case of ice (see Fig. 264), all resemblance 
to a spherical packing arrangement is lost. The link between each 
pair of oxygen atoms is implemented by a hydrogen atom. In these 
four bonds, there are two oxygen atoms per hydrogen atom. (The 
structure shown in Fig. 264 does not, of course, contradict the chem- 
ical formula for water.) For purposes of clarity, a “hydrogen” bond 
is shown in the figure as a “neck”. The structure of ice is very loose, 
as indicated by the large “holes”. If one projects the structure above 
the plane of the figure, these holes are transformed into broad chan- 
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nels which pass through the structure. The structure of ice is an 
important exception to the general rule. This does not mean that 
the cases in which the likening of a crystal to a compact packingof 
particles is meaningless are rare. 
As indicated above, crystals 
formed of atoms bound by com- 
mon electrons cannot be likened 
to a compact packing of spheres. 
The structure of zinc sulphide, 
which is illustrated in Fig. 257, 
is quite typical. Moreover, sev- 
eral elements reveal a similar 
structure. These include carbon 
(diamond), silicon, germanium 
and tin (white). 
Homopolar bonds can form 
layers and chains of atoms. 
Fig. 265 shows the structure 
of graphite. The carbon atoms in 
graphite form a layered structure, Fig. 264 
but these layers are not the same 
as those of a compact packing arrangement. It is not possible to 
form a layer of graphite of contiguous spheres. In graphites, layers 
of strongly bound atoms constitute planes. Arsenic and phosphorus 


also form layered structures in this 
WW + YYW: ZY: UY LY” 
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sense, but the atoms of their layers 
are not arranged in a single plane. 
es Mi Myc 
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An example of a structure consist- 


ing of chains of strongly bound 
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stance is strongly bound to only 
two neighbours. In gray selenium, 
the atoms form an endless spiral 
about a straight line. The separa- 
tion between atoms of neighbour- 
ing spirals is considerably greater 
than the separation between close 
atoms in a given spiral. 

The black, lustreless, soft graph- 
ite used in pencils and the shiny, 
transparent, hard diamond which can cut glass are composed of the 
same kind of atoms, i.e., carbon. This is a very striking example 
of how greatly the properties of a crystal are affected by the arran- 
gement of its atoms. Refractory crucibles capable of withstanding 


atoms is provided by gray sele- 
nium. Each atom of this sub- 
9 UE VLA 
Fig. 265 
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temperatures of 2,000-3,000° C are made of graphite, while at temper- 
atures exceeding 700° diamonds are burned up; graphite has a spe- 
cific weight of 2.4 as compared with 3.5 for diamond; graphite 
conducts electric current, but diamond does not; etc. 

The ability to form different crystals is not a property of carbon 
alone. Almost every chemical element in the crystal state and 
every substance has several forms. Six forms of ice are known, nine 
of sulphur, and four of iron. 

At room temperature, atoms of iron form cubic lattices, the atoms 
being located at the corners and centres of cubes. Thus, each atom 
has eight neighbours. At high temperatures, iron atoms form a com- 
pact structure. Here, each atom has twelve neighbours. Iron possess- 
ing eight neighbours is soft, while iron possessing twelve is hard. 
The quenching of steel fixes, at room temperature, a compact cubic 
structure which is stable at higher temperatures. 

We have seen in the cases of carbon and iron that the structures 
of different crystals of one and the same substance may differ consid- 
erably from one another. The same holds true for other substances. 

Thus, for example, in a crystal, yellow sulphur forms corrugated 
rings of eight atoms each. Each ring constitutes a sulphur molecule. 
Red sulphur also consists of such rings but they are turned com- 
pletely differently with respect to one another. 

Yellow phosphorus atoms form cubic structures with eight close 
neighbours. Black phosphorus has a layered ‘structure similar to 
graphite. 

Gray tin has a structure similar to that of a diamond. Theorcti- 
cally, white tin could be obtained from gray tin by strongly com- 
pressing the diamond-like structure along the axis of a cube. As 
a result of this flattening process, a tin atom would have six close 
neighbours instead of four. 

Organic substances also frequently have a variety of crystal 
forms. The very same molecules are arranged differently with respect 
to one another. 


234. Thermal Vibrations in a Crystal 


From the viewpoint of energy, an ideal crystal is, in a way, the 
antithesis of an ideal gas. 

In an ideal gas, the interaction energy of particles is much less 
than the average energy of thermal motion kT. On the other hand, 
since strong coupling exists between particles in a crystal, the inter- 
action energy is much greater than 47. Therefore, thermal motion 
in crystals cannot disrupt the coupling between atoms, but merely 
results in small vibrations of the atoms about equilibrium positions. 
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In a crystal, every atom vibrates about an equilibrium position. 
For most crystals, the vibration amplitudes are of the order of 


0.1 A, i.e., a small fraction of the distance between close atoms, 


which, as we know, is of the order of 1.5-2 A. 

The nature of this vibration may be very complex. During a vi- 
bration period, an atom describes a complex curve about its equilib- 
rium position. This is due to the fact that an atom is bound to its 
neighbours by different forces; hence, its vibrations are of an ani- 
sotropic nature. In any case, it is always possible to resolve the 
vibrations of an atom along three axes. Evidently, the atoms of 
a crystal will have 3N degrees of freedom, where N is the number 
of atoms. 

If the molecules of a crystal are clearly distinguishable, it is 
meaningful to speak of vibrations of a molecule and vibrations of 
atoms within a molecule. Since the coupling between molecules is 
considerably weaker than between atoms, the frequencies of their 
vibrations will be less. In molecular crystals, the motion of amolecule 
as a whole is of decisive importance. The vibrations of a molecule 
about its equilibrium position are of a translational as well as tor- 
sional nature. Apparently, it is even possible in rare cases for total 
rotation of molecules about centre of gravity to occur. For example, 
such rotation of molecules probably occurs in the case of solid 
methane (CH,). 

The total energy of a vibrating particle consists of its potential 
energy and its kinetic energy. The average values of these two ener- 
gies over a period of vibration are equal to each other. As is known, 


the average kinetic energy of an atom in a gas is equal to < kT. It 


would seem reasonable to assume that, at the same temperature, 
a vibrating atom possessing twice the average energy possesses 3 kT 
units of thermal energy. Then, one mole of crystalline substance 
should have an energy 3RT and a molar heat capacity C) = 3R ~ 
~~ 6 cal/mole. 

At high temperatures, this formula is in very close agreement with 
experimental results. The temperature dependence of the heat capac- 
ity of crystalline bodies is shown in Fig. 266. Beginning at zero, 
the heat capacity increases, attains a value of 6 cal/mole at a certain 
temperature, and then remains unchanged. The ratio of tempera- 
ture to a constant @ (to be discussed in the following article) is plot- 
ted along the z-axis. 

At high temperatures, the value of kT is significantly greater than 
the difference between vibrational energy levels; hence, the quantum 
nature of distribution of vibrating atoms according to. energy levels 
does not affect the value of the average vibrational energy. Under 
such conditions, the simplified method of calculating the average 
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energy is quite justified. This is confirmed by exact calculations 
which take into account the distribution of atoms according to ener- 
gy as given by the Boltzmann law. 

When AT becomes comparable to the difference between energy 
levels, the Boltzmann law is no longer applicable and must be re- 
placed by a quantum distribution law (see p. 684). Calculations indi- 
cate that heat capacity decreases with decreasing temperature. We 
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Fig. 266 


shall not present these calculations; we note, however, that quali- 
tatively a decrease in C,, with decreasing temperature is quite under- 
standable. The smaller the magnitude of k7, the smaller the number 
of energy transitions that may occur in a system. This means that 
the possibility of thermal exchange decreases with decreasing kT 
and approaches zero as a limit. The limiting case can be explained 
as follows: Energy cannot be transmitted to a body in extremely 
small bursts #7 even if there are an infinite number of such “bursts” 
with a very high total energy. Energy cannot be transmitted since 
a single “burst” does not suffice to transfer a system from its zero- 
energy level to the next one. 

Experiments indicate that energy transitions corresponding to 
a change in the state of molecular motion in a crystal lie in the region 
of long infrared waves. 

For purposes of guidance, let us assume that such transitions cor- 
respond to a wavelength of 1 mm. 
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Let us compare several values of AT with the quantum energy cor- 
responding to a wavelength of 1 mm. For this wavelength, v = 
= 3 x 10!" sec, i.e., hv = 200 X 107)’ erg. 


Temperature (°K) OT a Ee 
500 7,000 x 10-17 
400 1,380 x 10-17 
10 138 x 10-17 
{ 14x 10-17 


It can be seen that at 100°K the thermal vibration energy still 
considerably exceeds the difference between the energy levels of 
a molecule in a crystal. At 10°K it has the same order of magnitude 
and at 1°K thermal vibrations are unable to produce transitions 
from -one level to another. 


235. Thermal Waves 


An interesting feature of thermal vibrations in a crystal is their 
occurrence in the form of thermal waves. Therefore, atomic vibra- 
tions cannot occur independently of one another. An atom deviating 
from its equilibrium position pulls along the next one. 

Since a crystal is a finite body, standing waves are formed within 
it. As in the case of all natural oscillations, the maximum length of 
a standing wave equals twice the dimension of the body. The bound- 
aries of a crystal must correspond to standing-wave nodes. 

On p. 135 we discussed elastic vibrations of solids considered as. 
a continuum. It was shown that there arise in a finite solid body 
numerous standing waves of different direction and frequency. The 
picture becomes considerably more complex when the atomic struc- 
ture of the solid is taken into account. A theoretical investigation of 
the possible vibrational motion of atoms in a monocrystal indicates 
that thermal motion in a crystal can be represented as the result 
of the superposition of 3sN waves, where N is the number of cells. 
and s the number of atoms per cell. The number of possible waves 
is equal to the number of degrees of freedom of the system of atoms. 
forming the crystal. How do these waves arise and how are they 
manifested? 

Let us restrict ourselves to a consideration of a chain of atoms, 
i.e., a“‘unidimensional crystal”. Fig. 267 shows‘such an atomic chain, 
the “cells” of which.consist of two atoms, denoted by black and white. 
dots. Since the actual thermal motion of atoms in a crystal is of 
a very complex nature, the figure has been simplified to show the 
“elementary” waves into which this motion can be resolved. Calcula- 
tions indicate that the resulting vibration can always be represented 
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as the sum of harmonic vibrations. Like in the case of a solid rod, 
a series of waves of various wavelengths arise in a unidimensional 
crystal. If a chain consists of a thousand cells with period a, there 


arise NV waves of wavelength 2,000 a, 1,000 a, zn a, 500 a, 400 a, 


3 
etc. The shortest wavelength is 2 a. 

But this is not all. Each of the possible wavelengths occurs in s 
variations. Two types of waves of the same wavelength are shown in 
the figure. A case exists in which the lattice of atoms vibrates as 
a whole. Such a wave is called an acoustical wave. The remaining 
s — 1 waves are quite different. In 
these cases, different types of atoms 
execute complex motion with respect 
to one another and at each instant 
only atoms of a single type fall on 
the sinusoid. There are s — 1 such 
vibrations, which are called opti- 
cal vibrations. 

The figure shows waves corre- 
sponding to atomic vibrations in 
a single direction. 

Atomic vibrations can always 
be resolved into two transverse 
components and one longitudinal 
component. Therefore, a wave of 
given wavelength travelling in a given direction will have 3 com po- 
nents of the acoustical type and 3 (s — 1) of the optical type. Of the 
total of 3sN waves, 3N—two transverse and one longitudinal for 
each wavelength in each direction—will be acoustical. This is com- 
pletely valid for three-dimensional crystals. 

Although the wavelengths and frequencies of the waves have dis- 
crete values, we may utilise the results obtained on p. 136 and 
express, approximately, the number of acoustical vibrations having 
frequencies less than v as 


Fig. 267 


40 3 

Ber Ys 
Here, v is the volume of the crystal and c the velocity of the wave. 
The velocities of the longitudinal and transverse waves are differ- 


ent. Therefore, the total number of waves, which is equal to 3/, 
should be written as follows: 


4nv { 1 2\i5  _ 
7 ata) Vinax = 3, 


where c; is the longitudinal velocity and c; the transverse velocity 
of the wave. Whence, the value of the maximum frequency of vibra- 
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tion, Vmax, can be easily found. The corresponding wavelength, 


cy ct 


Amin =a ‘ ’ 
Vmax Vmax 


is, as it should be, of the same order of magnitude as a cell period. 

If the velocity of propagation of acoustical waves in a crystal is 
known, we can calculate V;qx, the value of which determines to a 
large extent the behaviour of a crystal. 

As was indicated above, heat capacity depends on the commensur- 
ability of the energy kv and the thermal energy AT. If hvpax « AT, 
thermal exchange excites all of the vibrations and waves in a crys- 
tal, i.e., all of the quantum transitions are possible. As a result, 
the quantum nature of thermal exchange is not apparent. Such 


hVmax 


a crystal has a characteristic temperature 0 = which is much 


less than the temperature of the experiment, i.e., 0 < 7. On the oth- 
er hand, if kvmax > KT, i-e., if the characteristic temperature 0 > T, 
only vibrations of low frequency are excited in the crystal because 
high energy levels cannot be surmounted by the thermal “bursts”. 

Here are examples of the characteristic temperatures (°K) of 
a number of crystals: 


Pb Benzene Ag NaCl Fe Be Diamond 
90 150 215 280 450 1,000 1,860 


For such substances as lead and benzene, room temperature is “high”. 
This corresponds to the horizontal portion of the heat capacity curve 
(C, = 6 cal; see Fig. 266). On the other hand, room temperature is 
low for beryllium and diamonds. Thermal vibrations of these sub- 
stances are excited to an insignificant extent and their heat capacity 
is considerably less than 6 cal/mole. The maximum vibration fre- 


quencies ae =" can be calculated from the above values of the 


characteristic temperature 0: 


Pb Benzene Ag NaCl Fe Be Diamond 
4.88x 1012 3.13x10!2 4.47x1012 5.84 1012 9.31012 20.8x 1012 38.8 1012 


It can be seen that the limiting frequencies of thermal vibrations lie, 
as was assumed in the example on p. 607, at the boundary between 
the infrared and radio bands (Amin > 107? cm). 


236. Thermal Expansion 


How can we explain the fact that the average distance between 
neighbouring atoms increases with increasing temperature? To 
answer this question, let us consider the curve of potential energy of 


39—1409 
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interaction between atoms or molecules (see Fig. 268). Irrespective 
of the peculiarities of the interaction between particles, the poten- 
tial curve is always”asymmetrical: in the direction of decreasing 
separation between particles 
the curve rises steeply, while 
in the direction of increasing se- 
paration a well wall is formed. 
This is due to the follow- 
ing simple fact: practically, 
the separation between two 
atoms or molecules cannot be 
decreased indefinitely, but il 
can be increased indefinitely — 
at great distances the bond 
between the particles is broken. 
The maximum and minimum 
distance between vibrating 
atoms may be noted on a poten- 
tial curve. The middle of the 
segment connecting the two 
limits corresponds to the aver- 
age position of an atom. When 
the temperature increases from 
Fie. 26 T, to T, the energy of a vibra- 
g. 268 . . : 

ting particle increases and the 


; particle passes over to another 
energy level (see Fig. 268). Since the potential curve is asymmetrical, 


the average position of an atom is displaced to the right. Therefore, 
the average separation be- 


tween atoms will be greater 
than the equilibrium sepa- 
ration between atoms at 
rest, i.e., the 7 correspond- 
ing to the minimum of the 
potential well. Thermal 
expansion results from the 
fact that the average sepa- 
ration between atoms in-. 
creases with temperature. 
The thermal expansion of : 
a crystal is anisotropic, i.e., Fig. 269 
in different directions the 
coefficient of linear expansion a has different values. Therefore, 
when indicating the value of a, the crystallographic direction of in- 
terest should be specified. 
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When it is required lo give a complete description of the thermal 
expansion of a crystal, an expansion diagram may be used. Such 
a diagram is shown for a naphthalene crystal in Fig. 269; a, b, ¢ 
are the crystal axes and Ay, Aq, Airy the axes of symmetry of the 
expansion diagram. The length of a radius vector drawn from the 
origin to a point on the surface of the diagram gives the value of @ 
in the given direction. 

The shape of an expansion diagram and its orientation relative 
to the cell axes accord with the symmetry of the crystal. It cannot 
be otherwise since physical properties must be the same in directions 
connected by symmetry operations. 


In order to determine linear expansion coefficients, we must have at our 
disposal means to measure very small displacements with a high degree of 
accuracy. Instruments for measuring thermal expansion are called dilatometers. 
Interference methods (see p. 358) can provide the required sensitivity (several 
hundredths of a micron or better), but these require perfectly ground speci- 
mens, which are very difficult to Lee 

In practice, quartz differential dilatometers are used. In such instruments, 
the specimen to be measured is placed in a cylindrical holder made of quartz 
glass. At the bottom of the holder a base prism is placed. One end of the speci- 
men rests on this prism. A quartz rod, which transmits the expansion of the 
specimen to the measuring portion of the instrument, rests on the upper end 
of the specimen. The displacement of the end of the rod is measured by means 
of a microscope or rolary mirror. Since the holder as well as the specimen ex- 
ands upon heating, the instrument registers the difference between the coef- 
ficients of expansion of the specimen and the holder. The necessary corrections, 
based on available data on the thermal expansion of quartz over a wide range 
of temperatures, may then be introduced. 

The most accurate method of obtaining an expansion diagram is by means 
of X-ray structural analysis—measurement of the displacements of diffraction 
spots. 

Coefficients of Linear Expansion 


| t°C ax104, deg-1 
Aluminium ...... 0-100 0.238 
Gypsum ........ 12-25 0.025 
Quartz, || to the axis . . 40 0.0781 
Quartz, | to the axis 40 0.1419 
TC@) 550028 , ett cen &. & — 10-0 0.507 


237. Crystal Imperfections 


Block Structure. The structure of an actual crystal differs consid- 
erably from that of an ideal space lattice. This assertion is based on 
numerous facts, including direct electron-microscopic observations. 


39* 
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However, our basic knowledge of inner crystal imperfections is de- 
rived, in the first place, from strength measurements. A crystal will 
rupture when it is subjected to a stress of less than a hundredth of 
the stress which an ideal object should withstand. Deformations and 
the rupture of crystals will be discussed in Chapter XXXIV. 
Here, we shall summarise our present knowledge of crystal imper 
fections. 

Briefly, the situation can be described as follows: A monocrystal 
does not constitute a single lattice. It consists of a large number of 
tiny blocks which are slightly misaligned (within the limits of 
several seconds or minutes) with respect to one another. The dimen- 
sions of the blocks may vary within rather broad limits. In most cases, 


they lie in the range of 10-®- 


AAADAAGOL oogeg 10-*cm. Plotting of the block 
OECD OOO O0CO00 dimensions of a crystal would 
NeLeveyose) CVLEVOICLOKOlS) probably yield a characteristic 
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200000 COO rangement of particles at the 
Helelelelelesevarorao wins pone Ate two blocks. 

, ere 1s good reason to 
Te elolereK9 that a liquid surface covared 


with soap bubbles can serve 
as an excellent model. 
An examination of Fig. 270 


shows that a “fracture” exists in 


the atomic rows close to the 
centre of the model. The portion of the “structure” illustrated in 


the figure can be represented as four blocks with a common corner at 
the centre of the model. An imperfection is clearly visible at the cen- 
tre. Here, the upper “atoms” have not fallen’ into their right places, 
i.e., they do not fit into the empty spaces of the compact packing 
structure. This fault has led to the splitting up of the crystal into 
blocks. us 

Many such faults—called dislocations—exist in a crystal. They 
are distributed randomly and may turn an atomic row to the left 
or to the right. Therefore, on the average, all crystallographic align- 
ments extend through an entire monocrystal with great exactitude. 
Dislocations give a monocrystal a block (or mosaic) structure. To be 
sure, the presence of chance microfissures, or empty spaces, several 
atoms deep, also facilitate the formation of block structures. 

An examination of the figure shows that a dislocation may also 
be pictured as follows: two adjacent rows, one of which has one par- 
ticle more than the other—an extra atom has “found its way” into 
one of the rows. 
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Dislocations. Fig. 270 represents a two-dimensional model of 
a crystal. It is as if each row were the projection of an atomic layer 
which is oriented perpendicular to the figure. The large fault would 
correspond in a three-dimensional crystal to a linear region perpendic- 
ular to the figure. This region may be called the core of the dislo- 
cation. 

Dislocation patterns not only explain the block structure of crys- 
tals, but also many other phenomena. Therefore, it is well to study 
these peculiar distortions of crystals 
in detail. 

There are two kinds of dislocations— 
simple and spiral. The dislocation illus- 
trated by the bubble model is of the 
simple kind. Schematically, such a 
dislocation is illustrated in Fig. 271a. 
The core of the dislocation is designated 
by an inverted T. The distortion is 
maximal near the dislocation plane 
dividing the crystal into two parts 
and rapidly diminishes in either direc- 
tion away from the dislocation line. 


Fig. 271b shows a top view of the two i ' : i ‘ i 
adjacent atomic planes on either side 4! $3 ¢¢ 44444 44 4 
of the boundary between the blocks. Pir Petra? et ug 
The upper, or compressed, plane (des- fi fo te Pet ee dt ¢ ! 
ignated by solid lines) contains one || i ; j i j 
row more than the lower one (desig- ; ‘ 
nated by dotted lines). b) 
Analogous diagrams for a so-called Fig. 271 


spiral dislocation is shown in Fig. 272. 

The lattice is divided into two blocks, one of which has slipped, 
so to speak, a distance of one period relative to the other (see 
Fig. 272a). At the axis shown in the figure, the distortion is max- 
imal. The region adjacent to this axis is called a region of spiral 
dislocation. It will be easier to grasp the essence of this distortion 
if we examine the diagram of the adjacent atomic planes on either 
side of the boundary between blocks (see Fig. 272b). This is the right 
view of the three-dimensional figure. The spiral dislocation axis is 
the same as in the three-dimensional figure. Solid lines denote the 
plane of the right block and dotted lines those of the left. As may 
be seen from the diagram, a spiral dislocation differs from @# simple 
dislocation. There is no extra row of atoms in this case. The distor- 
tion consists in the fact that the atomic rows change their closest 
neighbours near the dislocation axis, i.e., they bend and drop down 
One level. 
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Why is this called a spiral dislocation? This can be explained as 
follows. Let us move around the dislocation axis—along the nodal 
planes of the lattice—beginning at the lowest plane. After each rev- 
olution, we are one level higher. In this manner, we reach the top 
of the crystal, much the same as having climbed a spiral staircase. 
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Fig. 272 


In Fig. 272, the spiral motion would be counterclockwise. If the 
blocks were displaced in the opposite direction, the spiral motion 
would be clockwise. 

In a given object, one may encounter successive spiral dislocations 
having the same rotation direction. If two dislocations having 
different rotation directions are in the same plane, the resulting 
distortion is more complex. ; 

Imperfections within a Block. A crystal latlice may consist of 
blocks which also have imperfections. These imperfections may be 
in the form of lattice vacancies or foreign atoms. A very small num- 
ber of lattice vacancies and foreign atoms can result in considerable 
distortion. 

Fig. 273 shows the nature of these distortions. In (a) we see the 
effect of a foreign atom that replaces one of the basic atoms of a lat- 
tice, in (b) the effect of a foreign atom penetrating between basic 
atoms, and in (c) the effect of a lattice “vacancy”. The disturbance 
may extend to a depth of 5-10 lattice spacings in each direction. 
This encompasses a region of the order of 1,000 cells. Therefore, an 
impurity of the order of 0.4 per cent may fundamentally change 
the properties of a crystalline substance. (It should be noted, how- 
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ever, thal an impurity does not produce appreciable lattice distor- 
tion.) In Sec. 272 we shall deal with the case of semiconductors in 


which impurities of the order of one 
part in a thousand million may change 
the electrical properties of a body. 


238. Short-Range Order. Liquids 


It was seen at the beginning of this 
chapter that very many solids can be 
represented as compact packings of 
spheres. In such structures, the fraction 
of the toLal volume consisting of emply 
space is equal to 26 per cent. 

Copper is an example of such a 
crystal. How does the structure of a 
piece of copper change when it is 
melted? Experiments show that the 
volume increases by about 3 per cent. 
This increase is due to an increase in 
empty space, which now equals 29 per 
cent of the total volume instead of 
26 per cent. The compact structure 
has loosened and the spheres are able 
to move away from their “proper” posi- 
tions. The ideal order which is charac- 
teristic of a crystal has been dis- 
turbed. 

As a result of thermal motion, the 
spheres vibrate, in general, about their 
equilibrium positions and remain sur- 
rounded by the same neighbours. Now 


and then, neighbours may change when 


a space of the same size as the volume 
of a sphere happens to be created in 
the vicinity of the sphere. Owing to 
the closeness of particles in a liquid, 
a so-called short-range order arises. In 
a model of spheres, one sphere cannot 
approach another closer than the diam- 
eter of a sphere. Such a deviation 
from ideal randomness occurs in gases 
as well, but it is of little importance 
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Fig. 273 


there since the closest molecules in a gas are separated, on the average, 
by a distance ten times as great as the dimensions of a molecule. 
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Let us examine a molecule in a liquid and imagine two concen- 
tric spheres about it. Assume the radius of one is equal to that of the 
molecule and the radius of the other to three Limes this value. On the 
average, how many close neighbours does such a molecule have? By 
close neighbours we mean molecules located in the region between 
concentric spheres. Consider the example of copper, the volume of 
which increases by 3 per cent when it is melted. According to cal- 
culations there are, on the average, 11.6 atoms in the region under 
consideration. Thus, there are to be found about 12 close neighbours 

the centres of which are separated from 


Vir) the given atom by a distance equal Lo ils 
2 diameter. Closer neighbours are not to be 
found. 


It is clear that the short-range order 

affects not only close neighbours but suc- 

2 4 6 86 1 rA cessive ones as well. Therefore, it is cus- 
- tomary to describe the short-range order 

Fig. 274 by the average density of radial distri- 

bution of atoms. 

Let us imagine two concentric spheres of radius r and r + dr 
about an atom. For simplicity, assume we are dealing with a mona- 
tomic liquid. The volume of this spherical shell will be 4nr2dr. The 
number of atoms falling within this shell may be expressed as 


U (r) x 4nr? dr, 


where U(r) is the density of radial distribution of atoms. 

A U(r) curve for amorphous arsenic is shown in Fig. 274. Maxima 
of the curve indicate that certain interatomic spacings acquire con- 
siderably more “weight” than others. The origin of successive maxima 
is exactly the same as of the first. The density of packing is such as 
to allow the number of closest neighbours of a given atom to fluc- 
tuate only within very narrow limits, while the number of neigh- 
bours closest to the closest may fluctuate within somewhat broader 
limits. As the distance from the central sphere increases the random 
order becomes more and more evident. The U(r) curve approaches 
a limit, i.e., the short-range order fades away and gradually passes 
over into a random order. It is convenient to set the value ofU(r) 
equal to unity at r > oo. This distinctive order with regard to close 
neighbours, which fades away as the distance from the atom or 
molecule under consideration increases, is what is meant by a short- 
range order. 

The order in the arrangement of particles characteristic only of 
crystals is called a long-range order. This means that the three-di- 
mensional periodicity peculiar to a crystal does not fade away at 
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great distances. The arrangement of atoms along a nodal line regu- 
larly repeats ilself thousands and millions of times. 

When we discussed the structure of crystals, we saw that atoms by 
no means always behave like spheres. This applies to liquid struc- 
lures as well. 

In an ideal case, the short-range order in an atomic liquid should 
result in the number of close neighbours being equal to almost. 
twelve. Experiments show that metals the crystal structures of which 
consist of compact arrangements of spheres continue to have such 
a short-range order, i.e., an average number of close neighbours 
just short of twelve, after being melted. 

As indicated above, every atom of lithium, sodium and potassium 
in a crystal has eight close neighbours. The same short-range order 
is preserved in a liquid, but the average number of close atoms be- 
comes somewhat greater than eight. 

Simple substances in the crystalline state of which the atoms are 
firmly bound to a small number of neighbours behave differently. 
These bonds are broken when such substances are melted and the 
number of close neighbours per atom of fluid becomes greater than 
in a crystal of the same substance. 


239. Amorphous Bodies 


The word “amorphous” means “without form”. Amorphous solids 
are the antitheses of regular polyhedral crystals. However, the shape 
of a polycrystalline body is not regular even though it is not amor- 
phous. How then may crystals and crystalline bodies be recognised? 
They may be recognised, primarily, by their well-defined melting 
points. If heat is applied to a crystalline body, the temperature of 
the body increases until it begins to melt. Thereupon the tem- 
perature ceases to rise and the entire melting process takes place at. 
the melting temperature. 

Ordinary glass is a typical amorphous solid. It grows soft when 
it is heated and gradually goes over into the liquid state as the tem- 
perature is raised. 

This behaviour’of amorphous bodies can be explained by their 
structural peculiarities, leading to the classification of such bodies 
as liquids rather than solids. 

As indicated, there exists a long-range order of particles in crystal- 
line bodies. In amorphous bodies, only a short-range order of particles 
occurs and in this respect such bodies do not differ from liquids. 

Fig. 275a shows the structure of quartz (silicon dioxide), and 
Fig. 275b the structure of quartz glass. From the chemical viewpoint, 
a substance can be obtained in a crystalline form as well as in an 
amorphous form. The similarities and differences between these two 
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states can be clearly seen in the figure. Apparently. an amorphous 
body is a “damaged” crystal. In a crystal and an amorphous body, 
the number of close neighbours and the nature of the encirclement 
are the same. Possibly, a pentagonal ring is particularly advanta- 
geous energetically for SiO, groups. Since the symmetry of an axis of 
fifth order cannot produce a periodic structure (sec p. 592), amor- 
phous glass is obtained. 

The absence of a long-range order, which is characteristic of crys- 
talline bodies, is the immediate cause for the absence of a well-de- 


Fig. 275 


fined melting point. At the melting point, a transition occurs and 
the long-range order disappears. Only a short-range order in the 
arrangement of atoms remains. 

In amorphous bodies, the nature of the arrangement of atoms does 
not change when the temperature is increased. Only the mobility 
of the atoms changes, i.e., the atomic vibrations increase. At first 
only a few atoms are able to escape from lheir encirclement and 
change neighbours. This number gradually increases until, finally, 
the rate of such changes becomes the same as in water. 

The ease with which a given molecule may change its neighbours 
is related to an important property of a liquid, namely, its viscosity. 
The less frequently neighbours are changed in a liquid, the thicker, 
i.e., the more viscous, the liquid. Of course, an increase in tempera- 
ture, which increases the swing of molecular vibrations, results in 
a decrease in viscosity. It is also quite understandable that, tempera- 
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ture condilions being equal, the liquid whose molecules are more 
complex will be more viscous. Many liquids harden before they be- 
come very viscous. The high viscosity of glue, honey, tar and oil is 
due to the complex form of their molecules. 

When aliquid hardens, the exchange of molecules practically ceases. 


240. Short- and Long-Range Order of Atoms in Alloys 


When two or more substances, crystallise together, they may in 
certain cases form acommon crystal lattice. It depends on the relative 
values of the energy of interaction between homogeneous and heter- 
ogeneous particles whether such a mixed crystal is formed or not. 
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If the attraction between homogeneous particles is greater than be- 
tween heterogeneous particles, a mixed crystal is not formed. 

Metal alloys, which are widely used in various industries, are 
mixed crystals. By reference to the structure of alloys, we can 
clarify the concepts of short- and long-range order. 

In the simple case of diatomic alloys, we may encounter perfectly 
ordered structures in which a definite cell can be distinguished and 
the substance described as a crystal of a compound with the definite 
formula A,B,,. However, this does not always occur and in a num- 
ber of cases A atoms randomly replace B atoms in their lattice or, 
if they are small, randomly become lodged between B atoms. 

We shall discuss only a substitution alloy, namely, iron-cobalt 
(see Fig. 276). This alloy has a simple body-centred lattice structure. 
Each atom—iron as well as cobalt—has eight close neighbours. As 
regards the arrangement of atom centres, an alloy crystal is always 
perfectly ordered, i.e., the atom centres form the same body-cen- 
tred lattice under all conditions. The situation differs with respect 
to the distribution of iron and cobalt atoms. Let us consider the 
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cell points of a crystal to be divided into corners and centres of cubes. 
For perfect order, all corners are occupied, say, by iron atoms and 
all centres by cobalt atoms (Fig. 276a). The ideal long-range order 
of such a crystal may gradually deteriorate if atoms begin to occupy 
“foreign” sites. But as long as the number of atoms located at. their 
“own” sites differs from the number of atoms localed at “foreign” 
sites (Fig. 276b), the crystal may be said to have a long-range order, 
even though the order is, in part, “impaired”. The long-range order 
disappears when the distinction between “foreign” and “own” siles 
is lost, i.e., when half the atoms are at their “own” sites and half 
at “foreign” sites (Fig. 276c). 

It is important to note that when a completely ordered crystal 
is heated, the order is gradually destroyed, i.e., the percentage of 
atoms at “foreign” sites increases. There exists a temperature above 
which even a partially “impaired” long-range order cannot exist. 
This temperature is called the A-point (lambda-point). For an iron- 
cobalt alloy, the A-point is 770°. The transition from order to dis- 
order signifies that thermal motion has gained the upper hand over 
the “tendency” of atoms to maintain a long-range order. 

There is a great deal in common between the process of eliminat- 
ing the distinction between “foreign” and “own” siles and the melting 
process. Both processes result in the disappearance of a long-range 
order. However, melting results in the disappearance of the long- 
range order of atom centres, while passing through the A-point 
results in the disappearance only of the order in the arrangement 
of atoms of different elements. 

The basic characteristic of the structure of alloys of the iron- 
cobalt type is the possible existence of a partial long-range order. 
Such a partial long-range order can exist only with respect to the 
distribution of the iron or cobalt atoms, but not with respect to 
the arrangement of atom centres. 

Like in the case of melting, the elimination of a long-range order 
does not mean the elimination of order in general (a short-range 
order remains). 

The short-range order with respect to the distribution of atoms in 
iron-cobalt crystals consists in the “tendency” of cobalt atoms to 
surround themselves with iron atoms, and vice versa. If we take any 
atom and examine its eight close neighbours, we will find that the 
number of atoms of the other element will not be equal to one half 
of the total, i.e., four. Depending on the degree of perfection of 
the short-range order, an iron atom may be surrounded, on the aver- 
age, by five, six or seven cobalt atoms. 

The investigation of a copper-gold alloy shows that its short-range 
order has a high degree of perfection. This pertains not only to the 
number of closest neighbours, but to the number of closest to the 
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closest, etc. If a number of spheres are drawn about any of the gold 
atoms, it is found that the first shell contains, in effect, only copper 
atoms, while the second contains only gold atoms. In successive 
shells, the short-range order begins to deteriorate, but a predilec- 
tion for atoms of a definite element will be felt as far away as the 
tenth shell! 

It has been determined by means of very precise investigations 
using X-rays how a long-range order in alloy crystals is “created”. 
Experiments with cobalt-platinum alloys have shown that the re- 
gions of long-range order grow in a disordered crystal as crystal nu- 
clei grow in a liquid. These embryonic regions are arranged in a per- 
fectly definite manner relative to the axes of a crystal. 


241. Liquid Crystals 


To find examples of liquid crystals, one must turn to organic sub- 
stances. Molecules of substances forming liquid crystals are always 
elongated. Liquid crystals are encountered among viruses, and also 
among li oids, which are components of living tissue. 
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A substance forming liquid crystals exists as such in a definite 
temperature range. If a liquid crystal is heated, it turns into an 
ordinary liquid; if it is cooled, it becomes a crystal. 

The term “liquid crystal” is derived from the strange manner in 
which the properties of a liquid and a crystal are combined. A liq- 
uid crystal possesses fluidity and forms drops. However, these drops 
may be elongated rather than spherical, and,somewhat resemble jel- 
ly. Careful investigation shows that the order of molecules in such 
a drop is not like in ordinary liquids. 


Two kinds of liquid crystals are known. In one, the molecules are 
in a short-range order and parallel to one another. In the other, the 
order of molecules is even more peculiar. Here, the molecules are 
arranged in layers. Each layer consists of parallel molecules, which 

are in a short-range order. This is illus- 
' trated in Fig. 277. 

1 A soap solution consists of liquid crys- 
tals. The washing properties of soap are 
directly related to its ability to form 

i liquid crystals. A moleclue of soap has an 
I i elongated shape (transverse axis measur- 
—_— — —__ —~ = ing about 4 A and longitudinal axis 


—_ 30-40 A). At one end of a molecule a 

SS) ee negative electric charge is concentrated 
canes and this pole is attracted by water mole- 

* cules. A soap solution is a liquid crystal. 

i i ? 7 It consists of a large number of double 
layers of molecules, separated by layers 

i } I i I of water (see Fig. 278). In the double 
layers, the poles of the molecules are 

_—_- — turned outward, i.e., toward the water. 


oS The molecules of soap within a layer are 
——_—_— ——— _inaclose arrangement and ina short-range 
SS ee order. If there is little soap in the water, 
Fig. 278 the double layers of soap molecules are 
separated by large layers of water. AS 
more soap is put into the water, more and 
more double layers will be produced. The solution becomes saturated 
when the thickness of a water layer equals about 20 A. The double 
layers forming a liquid crystal possess great mobility. When we 
wash our hands, the layers slide easily relative to one another and 
the skin. Dirt from our hands collects at the poles of the molecules 
and is then released in the water. 


242. Polymers 


A large number of organic substances, composed of giant molecules 
consisting of thousands of atoms, have a peculiar structure. Such 
substances include plastics, kapron and artificial silk. The molecules 
of these substances consist of identical groups of atoms arranged 
in a chain (whence the term “polymer”). The atoms within a mole- 
cule are frequently in a long-range order. 

The properties of high-polymers having lateral chemical bonds 
between chains differ significantly from those of so-called linear 
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polymers, in which such bonds do not exist. Polymers having later- 
al chemical bonds between chains are rigid systems. Their atoms 
are arranged rather loosely and in a completely haphazard manner. 
Plastics of such polymers are used in the manufacture of buttons, 
kitchen utensils and various fittings. Linear polymers have interest- 
ing properties and structures. 
Although a number of points still Dp Lh yy A; 


remain unclear, the basic struc- Le ¥ Veg ii 
tural features of these substances LEEL 
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come arranged in parallel streams. 
Since solidification begins simul- 
taneously from a large number 
of centres, numerous stacks of 
chains, which collide during 


growth and pass round one anoth- YA 
er, are formed. As a result, the CX «79 £7 
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The role of a stack in a linear 6) 
polymer is somewhat analogous te 
the role of a crystalline particle 
in a polycrystalline substance. og 
Nevertheless, there is a_ signifi- ‘a ) er 
cant difference between the two, QI 
for the degree of order of a group iS) 4) nd b; 
of parallel chains forming a stack ‘e C > ror 
(consisting of thousands or tens of c) 
thousands of such chains) may vary 
considerably from case to case. Fig. 279 


Fig. 279 shows three kinds of order: 

a) crystal type—the axes of the chains form a perfect lattice and the 
azimuths of the chains are ordered; b) gas-crystal type—the axes of 
the chains forma lattice and the azimuths are disordered; and 
c) amorphous or liquid type—no lattice is formed (absence of long- 
range order). It should be noted that the displacements of the chains 
relative to one another in the longitudinal direction also may be 
ordered or disordered. Since a stack of chains, each of which con- 
sists of hundreds of thousands or millions of atoms, is very long 
and will constitute, therefore, a broken formation, it is evident that 
even in the case of ideal order in the arrangement of chains the order 
of a stack is not entirely like that of a crystal. Only the portion of 
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a stack for which the parallel chains are rectilinear can have a crystal 
arrangement. This means that in the case of ideal order each stack 
of chains consists of a sequence of crystalline regions (crystalline 
particles). 

The stretching of linear polymers consists in the unfolding of 
stacks of chains. A similar mechanism of elongation enables us to 
explain extensions of up to 1,000 per cent occurring in certain nal- 
ural and artificial high polymers. 

Rubber and polyethylene are the best known linear polymers. 
The high polymers used in the manufacture of artificial fibre also 
have the same kind of structure. 


CHAPTER XXXIII 
PHASE TRANSFORMATIONS 


243. Phase Diagrams 


A substance can have only one gas state and one liquid state, but 
it can have several crystal states (also, several liquid-crystal and 
gas-crystal states). 

The gas and liquid states of a substance are characterised by disor- 
der in the arrangement of particles. Ina gas, the ratio of the kinetic 
energy of the particles to their potential energy of interaction is such 
that the binding forces cannot restrain the particles from flying apart 
to the extent to which the vessel containing the gas permits. The 
liquid state has a definite form since the binding forces do not permit 
the molecules to have independent free paths.* At high pressures, 
the distinction between a gas and a liquid disappears. 

Since two basically different random arrangements of particles 
do not exist, every substance has one liquid and one gas state. A crys- 
tal is characterised by a definite arrangement of particles and, in 
principle, an infinite number of different crystal phases can exist 
for a given substance. In actuality, several different crystal phases — 
exist, as a rule, for one and the same chemical compound (diamond 
and graphite, white and gray tin, yellow and red sulphur, etc.). 

Every substance exists in one or another phase, depending on the 
external conditions, viz., the temperature and the pressure. It is 
customary to use a pressure-temperature diagram, instead of a 
table, to describe the conditions for the existence of the various 
phases of a given substance. A diagram of this kind is known as 
a phase diagram. 

Three such diagrams are shown in Fig. 280. In the upper left-hand 
corner, we see a phase diagram for an ideal substance having only 
one solid phase. The diagram is divided into three regions:. one re- 
gion indicates the conditions for the existence of a crystal, another 
for the existence of a liquid, and the third for the existence of a gas. 
The gas state, of course, is represented by the lower right-hand _por- 
tion of the diagram, i.e., where the temperatures are low and the 
pressures high. The solid phase is represented by the region of lowest 
temperatures and highest pressures. Such a diagram is very conveni- 
ent. In order to determine the state of a body at a pressure p and 


* In the absence of gravitational forces, a drop of liquid is spherical. 
40-1409 
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a temperature 7, all we need to do is find this point on the diagram 
and observe in which region it is located. 

In the upper right-hand corner of Fig. 280, the conditions forfthe 
existence of the various phases of sulphur are shown. This substance 
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Fig. 280 
has two crystal phases and therefore the diagram is divided into 


four parts. In the lower part of Fig. 280, the phase diagram for water 
is shown. Since it is difficult to represent such a diagram drawn to 
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linear scale on a single drawing, the pressure in this diagram has 
been plotted logarithmically. It will be seen that ice exists in five 
different phases, which are designated by the Roman numerals I, IJ, 
I1I, Vand VI. The phase which was originally designated as IV turned 
out to be an error. The less common phases of ice exist at higher 
pressures. 

On a phase diagram, dividing lines between phases, as well as 
points within the various regions of the diagram, have physical sig- 
nificance. At pressures and temperatures corresponding to points on 
the dividing lines, two boundary phases exist simultaneously. In 
the case df water, this would correspond to the condition of ice 
floating on water, with theice not melting and the water not freez- 
ing. The dividing lines may be called phase equilibrium curves. 

It should be emphasised that these are phase equilibrium curves 
and not points. This means that equilibrium between two phases 
can be realised at different temperatures if the pressure is varied 
accordingly. This may also be expressed as follows: the temperature 
of phase equilibrium is a, function of pressure; or, the pressure of 
phase equilibrium is a function of temperature. 
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Phase equilibrium curves may also be called phase transformation 
curves since transition from one phase to another occurs when this. 
line is crossed. 

The dividing line between a solid and a liquid is the fusion, or 
crystallisation, curve; and the dividing line between a liquid and 
a vapour is the vaporisation, or condensation, curve. We call the 
dividing line between a solid and a vapour the sublimation curve and 
the lines between two solid phases simply transformation 
curves. 

Processes involving a change of state are also conveniently indi- 
cated on a phase diagram. Usually, we are concerned with transfor- 
mations occurring at constant temperature or at constant pressure. 
These processes are represented on a diagram by vertical and hori- 
zontal lines, respectively. 

Several phase transformations are illustrated in Fig. 280. Line 
2-/ on the phase diagram of sulphur represents a cooling process for 
a sulphur gas at normal pressure. At'a temperature of 444.5°C sul- 
phur is transformed from a gas into a liquid, at 110.2°C from a liq- 
uid into a crystal phase and, finally, at 95.5°C from this crystal 
phase into another crystal phase. The compression of sulphur gas is 
illustrated in the same diagram by the process 3-4. By increasing the 
pressure, we are able in this case too to transform a gas into a liquid 
and then, at very high pressures (above point 4) into solid states. 
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Under certain unique conditions, three phases may exist together 
simultaneously. Such points are called triple points. Sulphur has 
three triple points: 1) gaseous, liquid and yellow sulphur existing 
simultaneously; 2) gaseous, liquid and red sulphur existing simul- 
taneously; and 3) a liquid existing in equilibrium with the two 
crystal phases. 

It can be proved on the basis of strictly thermodynamic principles 
that a quadruple point cannot occur. Thus, no conditions exist under 
which, for example, two crystal phases are in equilibrium with 
a liquid and a vapour. 

Every phase transformation is characterised by a transition tem- 
perature at a given pressure. We speak of the melting (crystallisa- 
tion), boiling, sublimation, etc., points of a substance. If the pres- 
sure is not indicated, this usually means that the transformation 
occurs at normal atmospheric pressure. 

An important characteristic of a transformation is its heat of tran- 
sition. The occurrence of latent heat of vaporisation and heat of 
fusion is well known, but heat of transition is a general phenome- 
non. A transformation which proceeds by heating absorbs heat. In 
accordance with the second law of thermodynamics, heat of trans- 
formation is uniquely related to a change in entropy: 


AQ=TAS, 


where 7 is the transition temperature. Therefore, it is evident that 
a phase transformation which proceeds by heating is accompanied 
by an increase in entropy. 


The transition (fusion, boiling) temperature can be calculated from 
the formula 
_ AQ 
Das 


ty 


i.e., it is equal to the quotient of the latent heat of transition divid- 
ed by the increase in entropy. But in this form the statement is of 
purely theoretical significance since practically the change in entro- 
py for a phase transformation cannot be precalculated. However, 
knowing the transition temperature and heat of transition from 
experiments, one-can determine accurately the magnitude of the 
increase in entropy. - 


The phase transition from ice I to ice III occurs at a temperature ¢ = — 20°C 
and a pressure p = 2,103 atmospheres. This transition is accompanied by the 
release of heat; each gram of ice releases AQ = 5.6 cal. Therefore, the change 
in entropy is 


As =42 _ 5-5 _ 9 002 cal/deg. 
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245. Phase Stability 


How do we explain the fact that under certain conditions a body 
constitutes a liquid and under others a solid? There are two tenden- 
cies which determine the nature of a state under given external con- 
ditions: first, the tendency of a body to possess a minimum of energy 
and, secondly, the tendency to possess a maximum of entropy. The 
first tendency is a consequence of the fact that a system of molecules 
behaves like any system of mass points subject to the laws of New- 
tonian mechanics and, as we know, a mechanical, system tends to 
possess a minimum of potential energy. The second tendency is 
a consequence of the second law of thermodynamics. 

During the transition from a gas to a liquid and during the transi- 
tion to a solid, the internal energy of a substance decreases. The ener- 
gy of a gas is higher than the energy of a liquid since in passing from 
a liquid to a gas work must be expended in overcoming the binding 
forces between molecules. And the energy of a crystal is lower than 
the energy of a liquid since an ordered arrangement of interacting 
particles is always more stable than a disordered arrangement. This 
can be proved rigorously, but the proof will not be presented. The 
statement appears rather obvious. Imagine, for example, a perfect 
lattice of spheres connected by means of springs. Any displacement 
of a sphere requires a certain amount of work. Hence, an ordered 
arrangement corresponds to a minimum of energy. 

Entropy behaves in a different manner. Roughly speaking, the 
greater the freedom of motion of the constituent particles of a body, 
the greater its entropy. A disturbance in the order or an increase in 
the separation between particles results in an increase in entropy. 

Thus, for a given temperature and pressure, the state of a substance 
is established as a compromise between entropy and energy. Using 
the second law of thermodynamics, we can obtain a quantitative 
expression for this general law. 

Imagine that a body is placed under “foreign” conditions, i.e., 
ice under the conditions for the existence of steam, etc. In such 
a case, an irreversible phase transformation (fusion, vaporisation, 
etc.) will occur in accordance with the second law of thermodynam- 
ics: the increase in the entropy of a body will be greater than the 
applied reduced heat, 


dQ 
dS > TT . 
Using the first law of thermodynamics, we can rewrite the inequali- 
ty in the form 


dS > Ut P® and dU—T dS +pdv<0. 
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Since the phase transformation occurs at constant temperature, we 
obtain 


d(U—TS)+pdv<0. 


If the process is not accompanied by a change in volume, the 
transition to a state of equilibrium takes place with d(U —,TS) < 
< 0, i.e., with a decrease in the quantity F = U — TS. This func- 
-tion is called the free energy. We have shown that aspontaneous phase 
transformation is accompanied by a decrease in free energy, i.e., 
the free energy of a stable state must be a minimum. 

If the process occurs at constant pressure, the transition to an 
equilibrium phase takes place with d(U — TS + pv) <0, i.e., 
with a decrease in the quantity D = U — TS + pv. This function 
is called the thermodynamic potential. Thus, a phase transformation 
at constant pressure is accompanied by a decrease in thermodynam- 
ic potential, i.e., the thermodynamic potential will have a mini- 
mum value at equilibrium. 

The opposing tendencies of entropy and internal energy are brought 
out in this statement: a decrease in energy and an increase in entropy 
result in a decrease in free energy or thermodynamic potential. 
These two tendencies have been expressed quantitatively in the 
relations showing that F and ® tend to a minimum. 

The formulated condition for phase equilibrium has numerous 
applications. For example, using this condition, we can derive a 
relation for the slope of a phase equilibrium curve. 

On such a curve, consider two points representing the external 


conditions T;, p, and T2, p2. The equilibrium conditions f 
points have the form s lor these 


Dy (Ts, Py) = Dy (Ty, py) and WD, (T2, pe) =, (T2, Pe). 
The subscripts of ® refer to the phases in equilibrium. Subtracting 
the first equation from the second, we obtain 
Ds (Lay Po) — Dy (T1, Pr) = Pe (2, Po) — De (74, py). 


Let us assume that the two points are close to each other, Then, 
by means of the formula for the increment of a function of two 
variables, we can transform the above equation into the form 


a0, am, - ADe a2 4 


ap Of +a, P= ap AE +a oP 
Substituting the values of the derivatives of the function ® = Y _ 
a@ aD _ ' 
— TS + pv, namely, 7 S and @p Us we obtain 


dp _ S;—S__ AS 
dT — yg 4 — V2 
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But since A =4¢ ; 


dp _ AQ ee . 7 
qe T Gon (Clapeyron-Clausius equation). 


Thus, the slope of the curve (the derivative ) is determined 
by the latent heat of fusion AQ, the temperature of the phase transi- 
tion 7, and the difference in the volume of the phases. If AQ is posi- 
tive, this means that the subscript 1 refers to the high-temperature 


phase. 


Let us apply the Clapeyron-Clausius equation to the case of melting ice. 
When ice melts, 1 cm? of water is obtained from 1.091 cm® of ice. The volume 
change vy — ve is equal to —0.091 cm? (the volume decreases). In this case, 
AQ will be the heat of melting and is equal to 80 cal/gm. The temperature 7 
equals 273°K. Hence, 


dT 1 XAv___ (273)(—0.091) _ 


3 
at 9.34 2g om? 
dp 


AQ 80 cal ° 


Since the dimensions of the above result somewhat obscure its significance, 

let us convert calories to atmospheres, recalling that 1 cal = 42.7 kg cm ~ 

~ 42.7 atm cm’. We obtain 
OTe 96015 ee 
dp 


atm 


Thus, increasing the pressure by 1 atm decreases the melting point of ice by 
0.0075 degree. 
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The above thermodynamic explanation of phase transition phe- 
nomena does not explain a number of observed facts. Thus, from the 
thermodynamic viewpoint, for a given p and 7 there can occur a single 
state (a point in one of the regions of a phase diagram) for which 
the free energy, or thermodynamic potential, assumes a minimum 
value. However, it is possible for graphite and diamond to exist side 
by side, and water can be obtained under the conditions for the 
existence of ice (supercooled water). Numerous other examples in 
which the above thermodynamic principles are violated may be cited. 
The situation can be described as follows: In addition to states 
which are stable under given external conditions, so-called meta- 
stable states may also exist. 

The free energy of a metastable state is not a minimum, but nev- 
ertheless the transition from this state to a state having a minimum 
energy is impeded. Different metastable states may differ considera- 
bly in their degree of stability. Sometimes a slight impulse suffices 
for a transition to occur to a “normal” state, while in other cases a 
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a metastable state may be, in actuality, no less stable than “normal” 
state. 

Various phase transformations can be delayed. Thus, water can be 
supercooled, i.e., at normal pressure, water may exist at a tempera- 
ture below 0°C; water can also be superheated, i.e., its temperature 
may be raised above 100°C without boiling. A vapour also may be 
obtained under atypical conditions (a supercooled vapour is said to 
be supersaturated). Transformation delays always occur in the solid 
state, i.e., the transformation of one crystal phase into a-second is 
delayed even though the conditions prevailing are those for the 
stable existence of the second phase. 

However, one type of transformation, namely, fusion, is never 
delayed. Thus, a crystal cannot exist under conditions that are 
stable for the liquid phase. 

We frequently have occasion to deal with supercooled liquids. 
Liquids such as glycerine considerably increase in viscosity when 
supercooled and may remain in the amorphous state for months or 
even years. Glass is another example of a supercooled liquid. 

The existence of a metastable state can be demonstrated in the 
case of a supercooled liquid by bringing the liquid into contact 

with a crystal. In such cases, crystallisation begins immediately. 
If the liquid is highly supercooled, the effect will be extremely vio- 
lent. When a snowflake is thrown into supercooled water, ice nee- 
dles dart through the water in all directions and in a few seconds 
the transformation is complete. 

_ Delays of crystal-crystal transformations are particularly interest- 
ing. Here, delays can occur, so to speak, in both directions. Yellow 
sulphur should be transformed ‘into red sulphur at 95.5°C. If sul- 
phur is rapidly heated, this transformation point may be “skipped” 
and the sulphur may be brought to fusion at a temperature of 113°. 
Now assume that the melt is gradually cooled. At 113° small crystals 
of red sulphur are formed. Cooling does not result in a transformation 
at 95.5°, and even at room temperature small crystals may exist for 
a considerable period of time. However, the transformation process 
proceeds, even though slowly, and within a day is complete, i.e., 
a yellow powder is obtained. Here, too, the metastable nature of 
the state is best demonstrated by dropping a small crystal int 
the melt. : 

In certain cases, we are interested in a substance in a phase which 
might be expected to exist under entirely different conditions. An 
example of this is white tin which is transformed into gray tin when 
the temperature is reduced to 13°C. Usually, we are more interested 
in white tin and are cognisant of the fact that in winter nothing can 
be done with it. However, white tin excellently withstands 20-30° 
of supercooling and only under severe winter conditions does it 
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begin to be transformed into gray tin. (The members of the Scott 
expedition to the South Pole perished as a result of ignorance of 
this fact. Liquid fuel taken on the expedition had been placed in con- 
tainers soldered with tin. At the extremely low temperatures prevail- 
ing, the white tin was transformed into a gray powder. As a result, 
the containers opened and the fuel was lost.) 

To explain transformation delays, let us consider the difference: 
between liquid-crystal and crystal-crystal transformations on the 
one hand and crystal-liquid transformations on the other. In the 
last case the long-range order of atoms disappears, while in the 
first two a long-range order is created. The elimination of long- 
range order does not require a great effort. Fusion begins at the sur- 
face; atom after atom is torn away from its neighbours and falls out 
of strict order. 

On crystallisation, short-range order is transformed into long- 
range order. The process begins at the surface and must proceed in- 
wardly, i.e., into the substance. The atoms, or molecules, are “forced” 
to establish strict order under extremely crowded conditions. Their 
motions must be harmonised for order to be established. As we have 
seen, the rearrangement of atomic order, which requires that atoms. 
undergo “organised” displacements from certain ordered positions 
to others, is all the more difficult. 

Transformations in the solid state always begin at the boundaries 
of grains, blocks, empty spaces, and at dislocations; in other words, 
wherever there is more freedom. If only several score atoms have 
occupied positions corresponding to a new order, oriented growth 
of the nucleus proceeds, i.e., one after another atoms begin to pass. 
from the old, less favourable order, or in the case of crystallisation 
from disorder, to the new order. This is the effect of a crystal parti- 
cle, or seed, which invariably puts an end to a supercooled state. 
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Vaporisation consists in the separation of fast-moving particles 
from the surface of a liquid. Two conclusions immediately follow 
from this, namely, vaporisation increases with increasing tempera- 
ture and requires the application of heat. If the vaporised molecules 
are continuously removed from the surface of the liquid, the vapor- 
isation process continues until all of the liquid has been trans- 
formed into vapour. 

Let us consider vaporisation in a closed vessel. In such a case, not. 
only do molecules separate from the surface of a liquid, but the re- 
verse process also occurs, namely, vapour molecules return to the 
liquid. The vaporisation process will continue until dynamic equi- 
librium corresponding to the given temperature has been estab- 
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lished. Of course, the liquid may completely vaporise without equi- 
librium being established with the vapour. 

When equilibrium exists, we say that a vapour is saturated. The 
pressure of a saturated vapour is a function of the temperature and 
is given by the phase equilibrium curve. By changing the tempera- 
ture, we either vaporise more of the liquid in a vessel or condense some 
of the vapour. This results in a 
change of the vapour pressure. 

It is clear why the density and 
pressure of a saturated vapour in- 
crease with temperature. The num- 
ber of molecules leaving a liquid 
rapidly increases as the kinetic ener- 
gy of the molecules increases. On the 
other hand, the number of vapour 
molecules returning to a liquid is 
almost independent of the tempera- 
ture since such a process requires no 
energy. 

The density of saturated vapour 
at a given temperature varies from 
substance to substance within a broad 
range. At room temperature, the 
density of saturated steam is equal 
to 13 mm, while that of saturated 
mercury vapour is only 0.005 mm. 

A clear picture of transition 


.,.... ,.. processes from gases to liquids may 
be obtained by considering isothermic compression of a gas, i.e., 


“vertical” processes in a phase diagram. In order to represent volu- 
metric changes, which are not shown in a phase diagram, let us draw 
an auxiliary diagram in which pressure is plotted as a function 
of volume (see Fig. 281). 

If gas compression occurs at a sufficiently low temperature, sooner 
or later we arrive at an intersection point with a phase equilibrium 
curve. At this instant, the pressure is equal to that of saturated 
vapour at the temperature of the experiment and the first drops of 
liquid appear. As long as a vapour is not completely transformed into 
liquid, the compressing motion of the piston will not be accompa- 
nied by achange in pressure since we remain at the same point in the 
phase diagram from the beginning to the end of condensation. The 
condensation process will be indicated by a horizontal line on the 
pressure-volume curve. 

The significance of points of a rectilinear segment on the diagram 
is clear. They describe a two-phase liquid-vapour system. Hach 


Fig. 281 
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point of such a segment corresponds to a definite ratio between the 
phases, which can be easily determined by means of a “lever” rule. 
Let us designate the volume of the liquid by v4, the volume of the 
vapour by v2, and the proportion of substance in the liquid state 
by z. Then, the volume of the wet mixture is given by 


v=wey,+(1—2) ve. 
Hence, 
_ vev 
~~ vg—ty * 
This is the “lever” rule. 

When the condensation process is completed, a steep rise occurs 
in the curve since liquids have very low compressibility. 

Now, let us increase the temperature to that of the next isotherm 
on the diagram. It will be practically the same as the first, except 
for one important difference, namely, condensation begins later 
since the pressure of a saturated vapour is greater at a higher tempera- 
ture. Moreover, condensation is completed earlier since the piston 
does not reach its preceding position owing to the thermal expan- 
sion of the liquid. 

By increasing the temperature further, we obtain a series of iso- 
therms in which the horizontal segments corresponding to two-phase 
systems become shorter and shorter. Finally, this segment disap- 
pears entirely. The decrease in the length of the horizontal segment 
indicates that the specific volume of the liquid is approaching that 
of the vapour. At a certain critical temperature, these volumes be- 
come equal and the isotherm no longer has a horizontal portion. In 
the pressure-volume diagram, the critical point is easily determined 
as the apex of the dotted two-phase region. In a diagram of state, 
the critical point is located where the liquid-vapour phase equilib- 
rium curve breaks off. 

As the temperature is increased, the isotherms resemble broken 
curves less and less and gradually approach the hyperbolas of an 
ideal gas. 

The existence of a critical point indicates that we were justified 
in stating that there is no basic difference between a gas and a liquid. 
We see that if we by-pass the critical point a transition from a liquid 
state to a gas state can be achieved without going through a phase 
transformation. 
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We cannot obtain a liquid by compressing a substance the temper- 
ature of which is above the critical point. When such a substance is 
highly compressed, it becomes very dense, with its molecules coming 
into close. contact with one another. Nevertheless, a liquid in the 
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usual sense of the term cannot be obtained. The substance cannot be 
poured into a glass like an ordinary liquid, i.e., its state has no dis- 
tinctive form. This is due to the fact that the phase equilibrium 
curve was not crossed during compression. The absence of such a cross- 
ing indicates that a two-phase liquid-gas system cannot be obtained. 
This means that we cannot get a liquid with a definite form; instead, 
the liquid fills the entire volume available to it. 

Compression must take place at temperatures lying below the 
critical point if a gas is to be liquefied. This is easily achieved if the 
required temperatures are reached by thermal exchange with cold 
bodies. However, in the case of oxygen, nitrogen and hydrogen, 
this is not possible since the critical temperatures are very low. In 
order to liquefy these gases, we must resort to Joule-Thomson or 
adiabatic cooling. 

In the former case, the gas is compressed by means of a compressor 
and passed through a refrigerator. Then, the gas enters a spiral 
tube and is allowed to escape through an aperture, which serves as 
the porous plug (partition) in the Joule-Thomson experiment (see 
p. 175), to a region of lower pressure (atmospheric). Upon expanding, 
the gas cools, rises to the top, and cools the spiral tube. Thus, each 
successive portion of escaping gas will be colder than the preceding 
one. Finally, a temperature is reached at which the gas is trans- 
formed into liquid. 

The other method of liquefying gas involves the use of an ex- 
pander. In a reciprocating expander, a gas is expanded adiabatically, 
performing work in moving a piston, and leaves the cylinder at a low- 
er temperature. By having each portion of gas cool the subsequent 
one, one can reduce the temperature to —150°C. Further cooling is 
impeded by the absence of suitable lubricants to maintain the fric- 
tion between the piston and the cylinder walls at a low level. A solu- 
tion to this problem was found by P. L. Kapitsa, who developed 
a refrigerating turbine—a turboexpander. A turbine is rotated by 
means of the gas from a compressor. The gas expands adiabatically, 
is cooled, and cools the subsequent portion of gas. Difficulties of 
lubrication are overcome by placing the bearings, requiring lubri- 
cation, external to the cold region. 
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When it is said that a substance “vaporises”, the reference is 
usually made to the vaporisation of a liquid. The vaporisation of 
solids is called sublimation. One of the most familiar examples of 
evaporation of a solid is the sublimation of naphthalene.. 

Every odorous solid sublimates to a significant extent. The odour 
is produced by the molecules separating from the substance and 
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reaching our olfactory organs. Usually, however, a substance wil] 
sublimate to an insignificant extent. Sometimes, sublimation may 
not be detected even by very careful investigation. But, in prin- 
ciple, all solids, including iron and copper, vaporise. If sublimation 
is not detected it simply means that the density of the saturated 
vapour is extremely low. That this should be so is quite natural. The 
motion of atoms and molecules in a solid is very ordered and there 
is little probability that a molecule will separate from the surface 
of the solid. 

The density of a saturated vapour in equilibrium with a solid 
increases with increasing temperature. It can be shown that a number 
of substances having a strong odour at room temperature do not 
manifest it at a reduced temperature. In most cases, the density of 
the saturated vapour of a solid cannot be increased significantly 
simply because the substance melts first. 

Vapours are frequently used to obtain crystals when the latter are 
required in very pure form. This can be accomplished, for example, 
by precipitation on slightly cooled glass. 
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The transition from a liquid to a solid state (crystallisation) and 
the reverse transition (fusion) involve a fundamental rearrangement 
of particles. Upon fusion, long-range order in the arrangement of 
molecules or atoms disappears. 

For a given pressure, fusion occurs at a very definite temperature. 
The vibrations of molecules or atoms become so intense that the 
maintenance of long-range order becomes impossible. 

If the temperature is maintained at the fusion point, a liquid and 
a crystal may remain in a state of equilibrium, like in the case of 
a liquid and a saturated vapour. Crystals will neither grow nor melt. 

Iexternal pressure changes the fusion temperature. As a rule, the 
fusion temperature increases with pressure, i.e., fusion becomes 
more difficult. However, there are several exceptions to this rule. 
One of these is ice. The melting of ice is facilitated by an increase 
in pressure. In terms of a phase diagram, we can briefly describe the 
normal and anomalous behaviour of bodies as follows: Usually, 


a> 0, i.e., the equilibrium curve forms an acute angle with the 


‘ She 0 and the curve forms 
an obtuse angle with the abscissa. The strange behaviour of ice is 
related to another anomaly, namely, ice is lighter than water. The 
relationship of these two anomalies is shown in the following equa- 


: : 7 dp Q ? 
é w 5 ED = 
ion, which was derived above: a Tas The overwhelming 


temperature axis. In the anomalous case 
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majority of solids are denser than their liquids. Evidently, under 
such conditions, a pressure which produces packing should facili- 
tate fusion. The relationship between the two anomalies is quite 
natural. Consider a liquid and acrystalin a state of phase equilib- 
rium. Let us raise the pressure without changing the temperature. 
The atoms should approach one another. If the solid is denser, the 
liquid is transformed into a crystal state. If the liquid is denser, the 
reverse transition occurs. 

The anomalies of water play an extremely important role in our 
lives. If they did not exist, rivers would freeze at the bottom. The 
anomalies of ice and other such bodies are due to the structures of 
these bodies. Ice crystals do not conform to the law of compact 
packing of particles. Thus, a disturbance of long-range order results in 
an increase in density rather than a decrease, as is usually the case. 

Let us return to Fig. 264 (see p. 603). The broad ice channels may 
displace molecules of ice by expanding somewhat. When ice melts, 
molecules may “fall” into this channel. In such a case, of course, the 
density will increase. No theory exists by means of which the heat 
of melting or the melting temperature could be predicted. This is 
due to the dependence on a great many structural factors. To be 
sure, the heat of melting is easier to predict, since the melting 
temperature is equal to the heat of melting divided by the entropy 
of melting. 

A measure of the binding forces between molecules or atoms is, 
of course, the heat of sublimation (the energy required to break the 
intermolecular bonds) rather than the heat of melting (the en- 
ergy required to eliminate long-range order). 

As indicated above, fusion of crystals cannot be retarded. On the 
other hand, crystallisation can be retarded and, in fact, sometimes 
will not occur at all. In order for crystallisation to begin in a liq- 
uid, there must appear a nucleus, i.e., a system consisting of several 
scores of atoms or molecules which have assumed an arrangement 
corresponding to that of a crystal of the substance. Moreover, condi- 
tions in the liquid must be favourable for the growth of this nucleus. 
In most liquids, it is difficult to achieve a significant retardation in 
the process of formation of nuclei. Cooling under strict conditions 
is required to achieve such retardation. Dust particles must be pre- 
vented from falling into the liquid and all mechanical disturbances 
such as vibrations and the jarring of the vessel containing the liquid 
must be avoided. 

At sufficiently great supersaturation, it is probably impossible to 
avoid the spontaneous formation of nuclei, i.e.,-to avoid the stabi- 
lisation requisite for the crystallisation of atomic or molecular 
groups. But something else may occur. When the temperature is de- 
creased, the mobility of the particles may decrease to such an extent 
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that the rate of growth of crystalline nuclei approaches zero. That 

is how glass is formed. Crystals will invariably grow when a few. 
crystal particles (seeds) are to be found in a liquid under conditions 

of thermodynamic equilibrium with a crystal phase. Crystals for 

industrial purposes are grown by means of seeding. 

If heat is removed very slowly, i.e., the temperature decreases 

a fraction of a degree per day, and if the crystalline particle turns in 

the liquid, the crystal grows only on a few of the faces possible. The 

growth occurs, of course, on those faces having the least surface ener- 


gy. The indexes of such faces are always prime numbers. Most. fre- 
quently, faces with the highest surface density of atoms or mole- 
cules are formed. It is difficult to determine beforehand which faces 
will grow, especially since this depends on numerous subsidiary 
circumstances. Moreover, we shall not consider the growth of crys- 
tals from solutions having certain peculiar features. However, it 
may be asserted that a crystal in equilibrium with a melt (or a solu- 
tion) will assume a form such that its surface energy is a minimum. 

The mechanism of crystal growth consists in each successive par- 
ticle attaching itself to the crystal at a point where the binding 
forces are a maximum and, therefore, the potential energy a mini- 
mum. Fig. 282 shows three possible phases at which an atom can 
become attached to a growing crystal. At A the attracting forces 
acting on an atom are greater than at B, and at B greater than at C. 
Thus, a molecule or atom will invariably attach itself more easily 
i: a layer already partially formed rather than begin to form a new 
ayer. 
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According to calculations, in certain cases the initial formation 
-of a new layer is bound up with such difficulties that the very growth 
of a crystal becomes incomprehensible. In such a case, the spiral 
mechanism of explaining growth is most applicable. It is evident 
from Fig. 272 that spiral growth can continue indefinitely and new 
atoms and molecules continually become attached to points which 


Fig. 283 


are favourable from the energy viewpoint. Thus, it is not necessary 
for growth to take place through the formation of a new layer. The 
beginning of spiral growth takes place with the formation of a fault 
known as a spiral dislocation. Such a “fault” is usually due to a tiny 
foreign inclusion. A portion of the surface of a crystal which has 
developed spirally is shown in Fig. 283. 
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A transformation in the solid phase consists in a transition from 
one long-range order to another. The mechanism of such transfor- 
mations is of great interest. 

The simplest picture for the transformation of one solid phase into 
another in the case of simple substances is obtained when the struc- 
ture of both phases constitute compact packings of spheres. Thus, 
cobalt and thalium are encountered in the form of cubic as well as 
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hexagonal packings. By shifting a layer, we can transfer it from an 
“hexagonal” to a “cubic” state, and vice versa. 

It even has been possible to grow a single crystal of a hexagonal 
phase from a single crystal of a cubic phase by means of transforma- 
lions of this kind. Usually, this is not possible since the growth of 

_ crystals of a new phase begins simullanecously from many centres 
and a monocrystal becomes transformed into a fine crystalline sub- 
stance. In most cases, a crystal crumbles when il is transformed into 
another solid phase. Sometimes the 
outer “shell” of a polyhedral mono- 
crystal is preserved and a fine crys- 
talline substance occupies this perfectly 
symmetrical volume. 

The reason for the difficulty is 
clear. Crystals of a new phase may 
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begin to grow from various points. But close layers in a cubic face- 
centred lattice may be formed by four different systems. Let us 
return to Fig. 264 (p. 600). In the crystal shown in this figure, close 
planes are perpendicular to the spatial diagonals, of which there are 
four in a cube (eight corners). Thus, hexagonal crystals of four differ- 
ent orientations may grow from a crystal with a cubic packing arran- 
gement. G. V. Kurdyumov’s work, devoted to transformations of 
iron and steel, laid the basis for the study of the rearrangement of 
atoms in phase transformations. At high temperatures, iron exists 
in the form of a compact cubic packing of atoms. At low tempera- 
tures, the iron atoms become arranged in a body-centred lattice. 
This transformation, known as a Martin transformation, is of tre- 
mendous importance in metallurgy* and, therefore, should be con- 
Sidered in greater detail. 

In Fig, 284, we see what occurs when the temperature is increased. 
The left diagram again shows a compact cubic packing arrangement; 


_e-————_____ 
* The hardening of steel is nothing more than a Martin transformation. 
41-1409 
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the right diagram shows a body-centred packing arrangement drawn 
in a rather unusual form: we see the projection of an arrangement of 
atoms as it appears when viewed along a plane diagonal of a cube. 
It would seem that these two diagrams have little in common, the 
main difference being that the left diagram represents a three-slo- 
reyed structure and the right a two-storeyed structure (the trian- 
gles are in the second storey). Less important is the difference in the 
angles of the rhombuses (not shown in the figure). As the temperature 
is increased, the atomic vibrations increase and the less compact 

- body-centred lattice becomes less advantageous at a temperature 
of 906°C. The two-storeyed structure becomes transformed into the 
three-storeyed one by the alternate shifting of the layers marked by 
triangles. For example, the odd layers shift to the left and the even 
ones to the right. This shift occurs along the diagonal of a rhombus; 
the angle of the rhombus changes at the same lime. 

When a phase transformation occurs in iron, crystal particles of 
the new phase may become oriented in any of 24 different directions. 
The number 24 is obtained in the following manner. There are four 
close planes in a cubic face-centred crystal and, as can be easily 
shown, a crystal of the new phase grows in six different directions 
in a close layer. 

Undoubtedly ordered, regular processes play an important role in 
the transition from one order to another. In such a rearrangement 
of order, atoms do not have to interchange places, i.c.., only an organ- 
ised shifting of atoms occurs. This is what takes place in a Martin 
transformation, a transformation which does not involve diffusion. 
However, in other transformations in a solid, diffusion phenomena 
may play an important role. 
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It has long been known that foreign atoms diffuse in a solid. The 
surface layer of steel may be impregnated with carbon (cementation), 
nitrogen or boron. Diffusion occurs to a great depth and it is not par- 
ticularly difficult to follow the process. At a temperature of 200-300°C, 
significant quantities of silver penetrate lead to a depth of several 
centimetres in an hour. 

However, not only foreign atoms migrate in a crystal. An iron alom 
can migrale in a crystal of iron and a copper alom in a crystal of 
copper. If a piece of radioactive copper is pressed against an ordi- 
nary piece of copper, the latter will soon become “contaminated” 
(radioactive). By means of tagged atoms, one can study the diffusion 
of atoms of the same kind, as well as of “foreign” atoms. 

Diffusion is possible as a result of thermal vibrations. When an 
atom leaves ils equilibrium position, a neighbour takes its place. 
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Upon returning, the atom occupies the vacated spot. Thus, atoms 
interchange positions. Such an interchange is, of course, not easily 
achieved if only two atoms participate in the process. When two 
atoms interchange posilions in a solid, a whole group of atoms are 
involved. An atom slips forward only when the thermal vibrations of 
many atoms accidentally create favourable conditions for this. 

Dislocations, emply spaces and fractures, which always exist in 
a crystal, play an important role in diffusion. The presence of an 
emply space in a crystal facilitates the step-by-step migration of 
an atom through the lattice. An atom hindering this migration is 
“pushed” into the empty space. 

If a foreign atom is not very large, it may move through the lat- 
tice without interchanging positions with lattice atoms. When the 
conditions become favourable, such an atom slides from one empty 
space in the compact packing of spheres to the next. 

Diffusion is a two-way effect. If a zinc plate is pressed against 
a copper one, zinc atoms will penetrate the copper and copper atoms. 
will penetrate the zinc. To be sure, the rates of flow in the two direc- 
tions may vary considerably. 

The diffusion of atoms through a crystal depends on many factors. 
It is interesting that a diffusion process proceeds most rapidly 
when the foreign atoms differ in all respects from the atcms of the 
crystal through which they move. Diffusion proceeds mcst slowly 
for atoms which are the same as those of the crystal or in the same 
column of the Mendeleyev periodic table as those of the crystal. 

As already indicated, the presence of fractures and dislocations 
facilitate diffusion. Therefore, diffusion proceeds most rapidly in 
a deformed metal. The rate of diffusion is greatly dependent on tem- 
perature. This is not surprising since the diffusion coefficient, the 
coefficient of proportionality between flow of matter and concentra- 
tion gradient, can always be represented by an expression of the form 


Ae-U/kT 


where U is the height of the potential barrier which an atom must 
surmount in an elementary diffusion event. The existence of such 
a relationship is rather evident since the diffusion coefficient must 
be proportional to the number of atoms the energy of which suffices 
to cross the potential barrier. 

Such barriers are quite high. In the case of self-diffusion, they are 
usually about 1-2 ev. It will be recalled that kT at room temperature 
is equal to ~0.03 ev. The number of atoms with energies much 
greater than the average is very small; hence, practically no diffusion. 
occurs. At a temperature of the order of 1,000°C, the situation is 
entirely different. 


4i* 
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In the preceding article, we discussed crystal-crystal transforma- 
tions occurring in an organised manner without diffusion. It should 
not be assumed that all phase transitions occur this way. On the 
contrary, at a sufficiently high temperature, the interchange of posi- 
tions by atoms begins to play a very important role and the organ- 
ised nature of transitions will be achieved in only small regions, or 
may even be completely obscured by the interchange of positions 
by atoms. 

If a solid state transformation is of a diffusive nature, il proceeds 
at a rate that is comparable to that of self-diffusion processes. The 
heights of potential barriers surmounted by atoms during rearrange- 
ment are of the same order as during self-diffusion processes. 

In the case of organised displacement of atoms of the Martin 
transformation type, the transformation proceeds al a_ ten-fold 
rate at low temperatures. 


CHAPTER XXXIV 
DEFORMATIONS OF BODIES 


253. Elastic Properties 


For every solid, there exists a distorting force limit up to which 
a deformation is elastic. This means that if the elastic limit is not 
exceeded, the body returns to ils original state. 

Ilastic deformations, like other deformations, are associated with 
the displacement of atoms (or molecules). When a body is extended 
elastically the interatomic spacing increases and when it is com- 
pressed the interatomic spacing decreases. 

The distinctive feature of elastic deformations is that they do 
not destroy interatomic bonds or create new ones. 

When a crystal is deformed elastically, all of the atoms continue 
to have the same neighbours. Thus, in elastic displacement, the lat- 
tice of a crystal as a whole becomes deformed (sloped). Hence, each 
atom continues to have the same neighbours. This enables the body 
to return toils equilibrium state when the distorting force is removed. 

The change in interatomic spacing that may be achieved by means 
of elastic extension or compression is quite small. The maximum rel- 
ative elongation of an elastic nature does not, as a rule, exceed Q.001. 
This means that for interatomic spacings of the order of 2A the 
equilibrium, positions of the atoms may be displaced by no more 
than 0.002 A. Such small changes in a lattice period can be detected 
through X-ray analysis by observing the displacement of diffraction 
lines on a roentgenogram. This requires that the lines be filmed at 
large 0 angles, since only in this manner can small changes in inter- 
planar spacings be detected (see p. 391). 

The elastic deformation of polymers such as rubber is of an entire- 
ly different nature. Rubber has mechanical properties which basi- 
cally differ from those of crystalline substances. The fundamental 
difference lies in the magnitude of elastic elongation. Certain kinds 
of rubber may be stretched to 10-15 times their normal length without 
exceeding the elastic limit. Thus, they may be elongated 10,000 
times more than metals! The magnitude of the modulus of elasticity 
of rubber is no less striking. 

A steel wire having a cross-section of 1 mm? will stretch one twen- 
ty-thousandth of its length under the action of a load of 1 kg, but 
a rubber band having the same cross-section will stretch to twice 
its original length under the action of such a load. 


646 Deformations of Bodies 


Two processes occur when polymers are stretched. First, tangled 
bundles of molecules become disentangled. At the same time, there 
occurs regular packing of certain portions of the disentangled bun- 
dles of molecular chains into a three-dimensional order. Evidently, 
the fact that crystallisation takes place upon stretching is of second- 
ary importance, since the established order does not remain when 
the external force is removed, i.e., the bundles of molecules become 
twisted once again. 

The twisting of bundles of molecules is accompanied by an increase 
in entropy, i.e., an increase in the degree of disorder. It turns 
out that the internal energy of rubber and similar polymers practi- 
cally does not change when such substances are elastically deformed. 
Therefore, the work of stretching, which according to the fundamen- 
tal laws of thermodynamics is dA = dU — TdS, in this case simply 
equals —TdS, i.e., it is directly proportional to temperature. (It 
should be recalled that the work of external forces on a system is 
considered to be negative.) In this respect, the elastic deformation 
of rubber is of the same nature as the isothermal compression of 
a gas (cf. p. 178). 

It should be noted that both the elastic deformation of a crystal 
and that of rubber do not yield a new potential energy minimum. 
In the case of a crystal, this is due to the fact that the energy does 
not change at all. 


254. Plastic Properties 


Slippage. The elastic deformation of a crystal consists in the chang- 
ing of its interatomic spacing with each atom maintaining ils same 
neighbours. On the other hand, in the case of a plastic deforma- 
tion—a deformation which remains when the external force pro- 
ducing it is removed—atoms surmount their potential barriers and 
enter new “potential wells”, i.e., change their neighbours. The basic 
mechanism of plastic deformation is the slippage of one atomic plane 
relative to another. An element of such slippage consists in the dis- 
placement of all of the atoms by one period. This can be detected 
with the naked eye in the form of so-called slip bands. Slippage 
occurs at the weakest points (fractures and other defects) and the 
crystal breaks up into layers (slip stacks). The plotting of stack 
thicknesses, the order of magnitude of which is equal to several 
tenths of a micron, yields a random distribution curve. The forces 
required to displace atomic planes having different indexes will 
differ. Usually, it is easiest to displace the planes which are most 
compactly filled with atoms. However, slip planes of crystals may 
change with changes in temperature and impurities and also during 
the deformation process itself. In aluminium, the plane (111) is 
a slip plane. 
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Slippage occurs along a given plane having a definite orientation. 
Usually this plane has the densest distribution of atoms (e.g., [1014] 
in an all-sided face-centred cubic lattice). 

In order for displacement to begin, a certain minimum stress 
(ultimate shearing stress) is required. The magnitude of this stress 
is very small, reaching several grams per square millimetre in some 
cases. In measuring the ultimate stress, one must, of course, take into 
account the orientation of the slip plane relative to the external 
force. 

Strength. A monocrystal of zinc can be easily bent by hand. How- 
ever, il is not possible to straighten it in the same manner. This 
is due to the fact that its strength has increased. 

The resistance of a crystal to displacement increases with increas- 
ing deformation. Therefore, plastic displacements along a given slip 
plane do not cause the material to rupture, but rather cease when the 
strength is sufficient to oppose the external force, and thereupon dis- 
placement begins in other planes. Thus, the number of slip bands 
increases and the slip stack thicknesses decrease. 

If an external force is applied to a crystal previously subjected 
to plastic deformation, such deformation will resume, of course, when 
the magnitude of the force reaches the value at which it previously 
ceased to be effective owing to an increase in strength. It can be stat- 
ed, therefore, that an increase in the strength of a crystal increases 
its elastic limit—and, moreover, by a large factor. 

One theory relates the described increase in the strength of a crys- 
tal to a disturbance of the regularity, i.e., distortion, of its lattice. 
From this viewpoint, it is quite natural that the strength of a crystal 
should increase with increasing rate of deformation and decrease 
with increasing temperature. However, from the viewpoint of the 
dislocation theory discussed above, the picture is different. 

Plastic Deformation as a Displacement of Dislocations. Let us 
consider in greater detail the process of displacing one atomic plane 
relative to another. If there are no dislocations in a slip band, it is 
necessary to shift every row of atoms in the displacement plane. The 
situation is quite different when a displacing force acts on a crystal 
containing dislocations. 

Fig. 285 shows a compact packing of spheres which contains 
a simple dislocation (only the end spheres of the rows are shown). 
For simplicity, let us assume that the dislocation region embraces 
a minimum number of rows. Then, the dislocation consists basically 
in the following: Between two rows of the upper, extended layer, 
adjoining the boundary between blocks, there is a linear gap. In the 
lower, compressed layer on the other side of the boundary between. 
blocks, there is an extra row of atoms (the two rows of atoms just 
below the linear gap are very compressed). Now, let us begin to dis- 
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place the upper block to the right relative to the lower one. At 
a certain initial instant, the “fissure” is between rows 2 and 3; rows 
2’ and 3’ are compressed. As soon as the force becomes effective. 
row 2 moves into the “fissure”, the shape of sphere 3’ is restored, 
and sphere /’ becomes compressed. The entire dislocation has 
shifted to the left, and it will continue to move in this direction 
until it is “pushed out” of the crystal. In other words, displacement 


Fig. 285 


consists in shifting the dislocation line along the displacement plane. 
It is clear that a much smaller force is required to achieve dis- 
placement when dislocations are present. 

According to calculations, the strength of a crystal in which there 
are no dislocations should he a hundred times as great as the strength 
of an actual crystal, determined experimentally. The presence of 
a small number of dislocations suffices to decrease the strength to 
a small fraction of that of an ideal crystal. 

Fig. 285 shows how a dislocation is “pushed out” of a crystal by 
an applied force. Thus, as the degree of deformation is increased, the 
strength of a crystal increases. When the last dislocation of a crystal 
is eliminated, its strength will be about a hundred times as great 
as that of a perfectly normal crystal. In this manner, an increase in 
strength can be easily explained. To be sure, in order Lo obtain quan- 
titative agreement between calculations and experimental results, we 
must assume that special as well as ordinary dislocations may aid 
displacement. 

Excellent confirmation of this theory is provided by the fact that 
the strength of perfect crystals which are grown artificially is approx- 
ximately equal to the calculated value for an ideal crystal. 
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255. Ultimate Strength 


As the stress within a body increases, its deformation increases 
up lo a certain point (the elastic limit). Then, plastic deformation 
begins and because of an increase in strength the curve rises sharp- 
ly. Finally, the rupture point is reached. 

The ultimate strength of a body depends, to a certain extent, on 
the duration of an applied force. When the force is of prolonged dura; 
lion, the ultimate strength drops. The values given in engineering 
handbooks generally refer to short-duration tests. The dependence 
of the ullimate strength of a body on the duration of an applied 
force indicates that in slow processes the behaviour of a solid depends 
on internal diffusion processes. It may be assumed, for example, 
that even small forces can create a favourable trend for processes 
involving an interchange of positions by atoms. 

The value calculated for the ultimate strength of a perfect crys- 
tal is several hundred times as great as measured values. This sig- 
nifies Unal faults play a fundamental role in a crystal. The calculat- 
ed value for the rupture strength of a monocrystal of rock salt is 
200 kg/mm?*. Under ordinary conditions, a rod of this substance will 
rupture when it is subjected to a load of 0.5 kg/mm. 

The role played by fractures was demonstrated in the well-known 
experiments of A. F. Yoffe, who investigated the rupture of rods of 
rock salt in water. Water dissolves the surface of such a rod and 
“heals” the microfractures formed during extension. As a result, 
the resistance of rock salt to rupture becomes greater than 100 kg/mm?, 
i.e., it approaches the theoretical value. 

The presence of faults serves to decrease the effective area at which 
rupture occurs. In the final analysis, the force acting at the instant 
of rupture is determined by the number of broken interatomic bonds. 

The strength of various materials determined in this manner vary 
within narrow limits. Thus, a piece of thread or rubber band rup- 
tures at astress of the same order of magnitude as that of a steel wire. 


256. Mechanical Properties of a Polycrystalline Material 


During the initial, elastic stage of deformation, the grains of 
a crystalline material are not deformed uniformly since they have 
various orientations relative to the line of action of the force. As 
a result, the elastic properties of a polycrystalline substance will 
differ from those of a monocrystal. 

However, in the plastic region, the behaviour of a polycrystalline 
substance may differ from the behaviour of a monocrystal of the same 
substance to an even greater extent. A polycrystalline substance 
offers greater resisLance to an external force. This is not surprising 
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since the development of plastic displacements in a given grain will 
be impeded by neighbouring grains in which slip planes are orient- 
ed entirely differently relative to the applied force. 

Moreover, in a polycrystalline material, there occurs an entirely 
new phenomenon: the turning of grains and the formation of tex- 
ture. The turning of grains in drawing, rolling and other deformation 
processes is determined by the tendency of each grain to become 
aligned in such a manner that slippage is facilitated, i.e., with its 
slip plane parallel to the applied force. If there are several slip planes, 
the grain assumes a position for which the effect of the slip planes 
is maximal. For example, in most metals having cubic all-sided face- 
centred cells, the grains tend to become arranged with the [141] 
directions parallel to the axis along which the material is drawn. 

Another new phenomenon occurring in such a material consists 
in the slippage of the grains relative to one another along an inter- 
crystal layer. Such displacement differs from crystal displacement 
and is more like viscous flow in thick liquids. 

A polycrystalline material may not rupture at the same value of 
stress as a monocrystal. In certain cases, the grains of a polycrystal- 
line material remain whole when the material ruptures. This occurs 
when intercrystal layers have weak mechanical properties. Asa gener- 


al rule, the strength of a material increases with decreasing grain 
size. 


257. The Effect of Surface-Active Substances on Deformation 


Every solid has numerous ultramicroscopic structural faults, 
arising as the result of the thermal mobility of ils atoms and the 
presence of impurities and mechanical defects. These faults are dis- 
tributed throughout the volume of a solid and in many cases can be 
viewed as extremely minute embryonic microfissures. The basic 
characteristic of such embryonic microfissures is their ability to in- 
crease in size during the process of deformation of the material. 

Under the action of external forces, the size of these microfissures 
increases and a concentration of stress arises at their edges. This, 
in turn, further facilitates the growth of the microfissures. In the 
case of a brittle material, such an increase in the size of microfissures 
during a deformation process may result in premature rupture of 
the material. In the case of a plastic material (most metals) such an 
increase in the size of microfissures during a deformation process 
results in the formation of plastic displacements. When a body is 
in a three-dimensional stressed state, the microfissures have wedge- 
shaped cross-sections and are characterised by exposed surfaces— 
orifices and cul-de-sacs in which the fissures preserve their embry- 
onic nature. Actual fissures terminate in cul-de-sacs like a sharp 
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blade having a very large curvature, with a radius of curvature of 
the order of magnitude of the lattice spacing. When the deforming 
forces are removed—in the range of elastic deformations—the micro- 
fissures gradually “heal” in reverse order, i.e., first the cul-de-sacs 
close and then the orifices. 

P. A. Rebinder has shown that the effect of the surrounding medi- 
um of a solid on its mechanical properties is not restricted to such 
chemical action as corrosion and dissolution. The exposed surface of 
a solid is always coated with a thin film of a component of the sur- 
rounding medium which is most closely related to the given solid. 
Such a substance may be a gas or vapour usually contained in air, or 
a substance located in the vicinity of the solid. The molecules of 
a substance clinging to the surface of a solid, or, as we generally 
say, adsorbed by a solid, are able to move along this surface and 
migrate from a region where there is an excess of such molecules to 
a region where there is a deficiency for complete coating of the sur- 
face. The tendency of an adsorbed layer to occupy all of the surface 
available to it is due to the fact that adsorption decreases the surface 
energy of a solid. Substances which may be adsorbed by the surface 
of a solid are called surface-active substances. Various organic alco- 
hols, acids, and salts of these acids, i.e., soaps, are highly surface- 
active substances with respect to metals. 

The strength of a solid decreases when it adsorbs a surface-active 
substance. If a solid is ruptured in a medium containing even a small 
amount of a surface-acltive substance (for example, in a solution of 
oleinic acid in pure vaseline oil), the force required to rupture the 
material is less than under ordinary conditions. This is particularly - 
evident in the crushing of soft rocks and in the subjection of metals 
to a variable force or a force of prolonged duration. 

The effect of adsorbed molecules on the strength of a solid can be 
explained as follows. When molecules are adsorbed by the surface of 
a body, they penetrate the microfissures as a consequence of their 
mobility and tendency to occupy all of the exposed surface of the 
adsorbent. The drawing of adsorbed layers into a microfissure is due 
Lo the decrease in the surface energy of a solid caused by such penetra- 
tion. If an obstacle is placed in the path of an adsorption layer tend- 
ing to occupy a surface area of a solid which is not yet occupied, 
the adsorption layer will exert pressure on the obstacle. Within 
a microfissure, such an obstacle is provided by the molecules them- 
selves, i.e., their size prevents them from penetrating deeper into 
the microfissure. Therefore, at the boundary of an adsorbed layer 
within a microfissure, a,pressure, which is directed so as to increase 
the size of the fissure in depth, arises. Adsorbed layers behave like 
wedges driven into the microfissures. Thus the penetration of adsorb- 
ing molecules into the orifices of microfissures tends to create 
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additional disrupting forces. This is equivalent lo increasing the 
external deforming forces. Therefore, the rupture of a solid in the 
presence of adsorbing substances is brought about by lower ap- 
plied forces. 


258. Material Breakdown Under the Action of a Stream 
of Particles 


The problem of material breakdown under the action of a stream 
of particles is of great importance in the construction of nuclear 
reactors. The materials of a reactor, including nuclear fuel, modera- 
tor, walls and instruments, are subjected to the action of neutrons, 
fission fragments, electrons, etc. Let us consider those actions of 
streams of particles which leave permanent effects. 

In the first place, we should mention particle collisions in which 
an electron providing a chemical bond between atoms is dislodged 
from its position. In such cases, ionisation results in the rupture 
of the bond. This bond is not necessarily re-established. Moreover, 
the ions or radicals which are formed may recombine in a different 
way. Therefore, in molecular materials, ionisation results in the 
breakdown of certain molecules and the creation of new ones. 

An atomic nucleus may also be displaced from. its position. In 
such a case, it drags along its electronic cloud. Therefore, it can be 
said that an entire atom, rather than just a-nucleus, has been dis- 
lodged from its position. Such an effect of the action of radiation is 
almost always irreversible. 

Materials are damaged by radiation as a result of the displacement 
of atoms from their positions and the rupture of chemical bonds. 
Atoms are dislodged under the action of heavy charged particles and 
fast neutrons. Chemical bonds are ruptured under the action of slow 
neutrons, y-rays and electrons. 

Let us consider more carefully what occurs when atoms are dis- 
lodged from their positions in solids. The process of displacement 
of atoms is a chain process. This means that the first displaced atom 
displaces another atom which is located in its path; the latter is 
able to displace a third atom, etc. By means of such a chain process, 
a single fast nuclear projectile is able to produce considerable distor- 
tion in the crystal lattice of a solid. The nature of the distortion 
varies considerably from case to case. A crystal lattice may be com- 
pletely destroyed. Foreign atoms may penetrale between atoms of 
the primary lattice. Also possible are processes involving the sub- 
stitution of atoms of the primary lattice by projectile atoms. The 
number of displacements per charged particle for one element does 
not differ greatly from the number for another element. An q-parti- 
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cle having an energy of 5 Mev, or a proton having an energy of 
20) Mev, dislodges 60-80 particles. 

These figures suggest that the heavier the particle, the greater the 
damage. Analogous figures for fission fragments of uranium-235 
or plutonium-239 nuclei are considerably more unexpected. Such 
a pair of fragments produces, for example, 25,000 displaced atoms in 
uranium and 8,300 displaced atoms in graphite. 

The process of slowing down a neutron from its initial velocity 
to thermal velocity also does not proceed without causing damage 
to materials. The slowing down of a neutron causes 450 atoms to be 
displaced in beryllium, 1,870 atoms in graphite and 6,030 atoms 
in aluminium. 

It is evident from these figures that significant changes in the prop- 
erties of a crystal lattice are to be expected as a result of the dis- 
placement of atoms from their positions in the lattice. Metals are of 
primary interest in ‘this connection. The reason for this is that the 
sole permanent effect of radiation in metals is the displacement 
of atoms. 

The action of neutrons and fission fragments has been studied most 
carefully. This is not surprising since such investigations are of basic 
importance in the design of nuclear reactors. The effect of a dose 
of 10! neutrons per sq cm has been studied in detail. Such a dose is 
not very large. Generally speaking, the materials of a nuclear reactor 
are subjected to such a stream of neutrons during each day of opera- 
tion. However, even in the case of such a small dose, the properties 
of metals undergo important changes. These changes approximate 
those which occur in the cold working of metal. Thus, under the 
action of neutrons and fission fragments a metal’s brittleness and 
hardness increase, its ductility decreases, and its electromagnetic 
properties also change. 

Radiation damage in metals consists mainly in the displacement 
of atoms from their positions, but in organic materials, where the 
atoms are connected by chemical bonds, the changes consist mainly 
in the rupture of such bonds as the result of ionisation. Organic mate- 
rials are very rapidly broken down under -the action of radiation. 
A dose of the order of 10** neutrons per square centimetre practically 
disintegrates an organic substance. In a reactor, paraffin, olefin and 
polyphenyl sustain a damage of 25 per cent within several hours. 

The transformation of an organic material usually consists in the 
liberation of a gas and in polymerisation. However, it should be noted 
that in certain cases high-polymer materials are depolymerised 
under the action of radiation. 


CHAPTER XXXV 


DIELECTRICS 


259. The Relationship Between Permittivity 
and the Polarisability of a Molecule 


In a number of cases, particularly in gases, the molecules of 
a substance do not interact with one another. Hence, the electrical 
properties of such a substance are determined by the average behav- 
iour of one of its molecules. Molecules do not interact with one another 
in many dilute solutions as well. Occasionally, molecular inter- 
action plays a secondary role even in a concentrated solution. 

Therefore, consideration of the electrical properties of a substance 
which consists of a large number of noninteracting molecules is of 
considerable importance. . 

The dipole moment of a unit volume of dielectric, P, is determined - 


by the permittivity © and the field intensity E in accordance with 
the formula . 


e—1 
P= iE 
(see p. 256). On the other hand, the polarisation vector P is equal to 
the sum of the dipole moments in a unit volume of dielectric: 


P=)\p 
or 


P=Np, 


where N is the number of molecules in such a unit volume and p 
is the “contribution” of each molecule to the polarisation vector. 
If E’ is the field intensity which acts on a molecule, then 


p= BE’, 
where B is the polarisability of the molecule. 

It would seem that the relationship between f and e€ should now 
be given by the expression ¢ = 1 + 4nNB. However, this is not so, 
and that is why the field intensity in the above formula was denoted 
by E prime. The equations relating P and E and p and E’ involve 
different field intensities. EZ is the force acting on a unit test charge; 
such a charge does not distort the existing field. Z’ is the field pro- 
duced by all the molecules (with the exception of the given one); 
acting upon the given molecule. 
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On p. 261, it was indicated that the field inside a dielectric sphere, 
E;, is related to the external field in which this sphere is located 
as follows: 


4 
E, =Ee— aP. 


It is evident that the field in a spherical cavity which is cut in a di- 
electric may be determined by changing the sign of P: 


4 
E; = E+ uP. 


{t may be rigorously proved that EZ’, the field due to all the mole- 
cules of a gas (except the one acted upon), is equivalent to the field 
in a spherical cavity. Thus, 


E' =E+SaP. 


The relationship between B and e may now be determined. Equat- 
ing the derived expressions for P, we obtain 


o—! E=Np=NBE’. 


4 


com E, we obtain the so-called 


eas Sei 4 e— 

Now, substituting 2’ = E+. +n is 
Clausius-Mossotti formula: 

e—1 4M ay 

ae a 


If each member of the equation is multiplied by > where M is 
the molecular weight and p is the density, the resulting expression 
will depend only on the polarisability B. Thus, NZ=Nay = 


= 6.02 x 1075 (Avogadro’s number). 
The quantity 


e—1l M 40 
F Say Se Nab 


is called the molecular polarisation. To determine the molecular po- 
larisation, one must first measure the permittivity of a substance as 
the ratio of the capacitance of a condenser filled with the substance 
under investigation to the capacitance of the condenser with the 
dielectric removed. Capacitance is usually measured by means 
of a bridge. Such bridges are constructed for the range 30 cps to 
300,000 cps. However, bridges may be constructed for frequencies 
up to 40 mc/s. 
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A wide variety of permittivity meters (instruments for measur- 
ing e) are employed. Such instruments are very sensilive and enable 
us to measure permittivity with a high degree of accuracy. Indeed, 
excellent results may even be obtained with gases having a pressure 
of the order of 1 mm of Hg. 


Using the formula ¢ = n? (see p. 331), we can derive an index of 
refraction equation which is analogous to the molecular polarisa- 
tion equation: 


_ ne—l M 


AN , 
= nm? 3 on = zT N avp. 
This characteristic of a molecule is called molecular refraction. 
Measured values of R and & for different frequencies of electromag- 
netic vibrations may differ considerably from one another. 
Despite the fact that the derivation of these formulas presupposes 
a gas, molecular interaction evidently changes matters little. In 


any case, the formulas for R and #are widely used in the investiga- 
tion of dilute solutions as well. 


Examples. Let us consider benzene, Clg (¢ = 2.28; p=0.88 gm/cem3; 
M 78), and water (¢ =81; p=1 gm/cm’; M=18). Assume that the plates 
of a flat condenser, which creates an electric ficld = 300 v/em=1 Gaus- 
sian unit, are immersed in these liquids. 


1. Let us calculate the polarisation (the electric moment of a unit 
volume of dielectric) of benzene and water: 


e—1 .23 — . 
Poencene =~ yee EB := ap Xb Gaussian unit; 
Pwater = 6.4 Gaussian units. 


Pees = P 
The contribution of each molecule to the polarisation vector is P=W° 


Nayd . 7 
where .V= ae is the number of molecules per unit volume; 


Poenzene = 1.5 10-23 Gaussian unit; 

Pwater =19.4X10-23 Gaussian unit. 

2. Let us calculate £’, the intensity of the electric field due to all 
molecules (except the one acted upon): 


4 : F 
Evenzene =E+ J Phenzene = 1.43 Gaussian units; 
E 
i.e., the field in water is about 28 times (!) as great as the applied field. 


Now, the polarisability of molecules of benzene and water can be deter- 
mined: 


water = 27-8 Gaussian units. 


Boenzene = ir -=1.05 10-23; Buater = 0.7 x 10728, 
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3. From measurements of ¢ by means of a permittivity meter, we can 
e—il M | 
e+2 p 


calculate the molecular polarisation # = 


Fovenzene :26.6 Gaussian units; 
P water — 17.3 Gaussian unils. 

4. Measurements of the index of refraction x by means of a_refracto- 
meter yield nyenzene =1-5014 and rweter = 1.330. Using these values, we can 
2 n°—1 M 
calculate the molecular refraction R -=+=————: 
n2t2 06 

Roenzene = 26.1 Gaussian units; 

Rwater =3-6 Gaussian units. 
IL is seen that in the case of benzene # = R and in the case of water the 
values of # and & differ considerably. The reason for this will be ex- 
plained in the next article. 


260. Polarisation of Polar and Nonpolar Molecules 


Polarisation of a substance under the action of an electric field 
may occur for two reasons. First, the centre of gravity of the electron- 
ic cloud may be displaced (inherent polarisability). Secondly, the 
field has an orienting action which may turn molecules having 
a constant, or rigid, dipole moment closer to the direction of the 
field. Therefore, it is customary to divide polarisability into two 
parts: a—inherent polarisability and b—orientation polarisability. 

A molecule must be turned as a whole in order for the dipole to 
become oriented. Owing to the inertia of a molecule, such turning 
requires a certain amount of time. For rapid electromagnetic vibra- 
tions, a rigid dipole cannot follow the field. Therefore, in the case 
of light waves, the orientation polarisability b is absent. 

Thus, 


f == “* Nay (a+b) and R= “= Napa. 


The polarisability a of a molecule can be determined by measuring 
the index of refraction. If, in addition, & is also measured, the orien- 
tation polarisability b is obtained by subtraction. 

The magnitude of the orientation polarisability is directly related 
to the rigid dipole moment p of a molecule. We shall show that 


motion. In the absence of a field, the assumption of any direction by 
the dipole moment p of a molecule is equally probable. The situa- 
tion changes if a field # isapplied. The potential energy of a dipole 
is equal to e (p.1 — g_), where pm; and g-_ are the potentials of the 
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field at the ends of the dipole, i.e., 


—eP i= — pE = — pF cos 0, 
where 0 is the angle between the field vectors and the dipole moment. 
A dipole oriented in the direction of the field has a minimum cnergy. 
This energy is equal to—p£. Thermal motion prevents all dipoles 
from assuming a position of minimum energy. A certain compromise 
distribution is established between the tendency to maximum entro- 
py and the tendency to minimum energy (see p. 629). The Boltzmann 
law is an expression of this compromise. The probability that the 
sneha a molecule lies between U and U + dU is proportional 
to eT dU. In our case, U =—pE cos 0. Therefore, dU=pE sinOd0. 
The fraction: of molecules the dipole moments of which lie 


pE 
between 0 and @ + d@ will be equal to e"T gi sin 0 do. 
For ordinary temperatures, pE < kT. Even for extremely strong 


fields of the order of 10° v/cm, the ratio te will be of the order of 


0.01 (the order of magnitude of dipole moments is 10-18 Gaussian 
unit). Therefore, we can use the approximation e* ~ 1+ 2, and 
the fraction of molecules sought will be equal to 


const (1 +2 cos 6) sin 0 dé. 


The integral of this expression from 0 to x should equal unity from 
the probability viewpoint, since for any molecule the direction of p 
lies somewhere between 0 and x. Then, ascan be easily verified, the 
constant is equal to Sand the fraction of molecules the polarisation 
vectors of which lie in the interval 8 to 6 + dO will be equal lo 


+ (1-4-5 cos 6) sino a0. 


The projection of the dipole moment on the direction line of the 
field is p cos 8. If N is the number of molecules per unit volume, 
the fraction contributed to the polarisation vector by molecules 
inclined at an angle @ to the field will be equal to 


5 Np (1-+-25 cos 0) sin 0 éos 6 d9. 


The polarisation vector P can be determined by integrating this 
expression from 0 to nm. We obtain 


p? ax. 
P=Naz EF, 
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hence, the orientation polarisabilily is given by the formula 


2 
b= ET: 


The relationship between molecular polarisation and temperature 
is expressed by the formula 


9 


4x p? 
f= i bee sa 
P= Nav (a+ fe) - 


This theoretical conclusion is in excellent agreement with experi- 
mental results. By measuring & as a function of 7, we can easily 
determine the two parameters which describe the electrical proper- 
ties ofa molecule, viz., polarisabilily and the “rigid” dipole moment p. 

Thus, the values obtained for a by measuring R can be used to 
determine p by substituting in the expression for &. 

Ixperiments indicate that in certain cases the interaction of di- 
poles of neighbouring particles may result in significant changes in 
permittivity as compared with the value of € for a system of nonin- 
teracting molecules. This can be shown by measuring e in a liquid 
and in a gas formed of the same molecules. 

The interaction of particles also affects the permittivity of crystals. 

As arule, electric polarisation in crystals occurs only as the result 
of the deformation of electronic clouds and the displacement of ions. 
No orientation polarisation occurs since, by and large, molecules 
cannot turn in a crystal. 

In many ionic crystals, the index of refraction squared is consider- 
ably less than the permittivity. (For example, the values for rock 
salt are 2.37 and 6.3, respectively, for titanium dioxide 7.3 and 114, 
and for lead carbonate 4.34 and 24.) In such crystals, the electronic 
cloud is deformed and, in addition, the ions are displaced as a whole 
under the aclion of a static field. On the other hand, it has been estab- 
lished that in molecular crystals the permittivity is equal to the 
square of the index of refraction. This indicates that polarisation is 
due exclusively to the deformation of the electronic cloud. 

Since orientation polarisation is absent, permittivity varies very 
little as the temperature changes. | 

It has already been indicated that in the case of a rapidly varying 
field there is no orientation polarisation and the molecular polarisa- 
tion becomes equal to the refraction. It is important to know which 
field oscillations should be considered rapid. This can be determined 
if the relaxation time is known. When the relaxation time t is 
much greater than the oscillation period, there is no orientation 
polarisation. ; 

The relaxation time t was discussed on p. 164. If a dielectric is in 
a constant field, its dipoles assume an equilibrium orientation dis- 
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tribution which depends on the temperature. When the field is 
switched off, the dipoles become disoriented. However, this does not 
occur instantaneously, i.e., the order decreases in accordance with an 
exponential law. The rate of this decrease is described by the relaxa- 


tion time t—Lhe time in which the polarisation decreases to — of ils 
a 


original value. If t is much greater than the oscillation period, the 
direction of the external field changes before the dipoles change their 
orientation. A very rapidly varying field does not affect the behaviour 
of the dipoles at all. If t < T, at each instant the state will be in 
equilibrium and the polarisation will closely follow changes in the 
field. For most dielectrics, the relaxation time is of the order of 
10-}2-10-#8 sec. - 

Examples. 1. Using the results of the example on p.656 let us deter- 


mine the values of the inherent polarisability @ and the orientation po- 
larisability b for benzene and water. 


= ; : P 
a=3R/4AN ay, whence apenzene =10-23 Gaussian unit; 


Qwater = 0.14X 10-23 Gaussian unit. 
On the other hand, 
a+b=3//40N ay, (@+b)oenzene = 10-23 Gaussian unil; 
(@+ b) water = 0.7X 10-23 Gaussian unit. 
It follows that 
Dvenzene =9 and bywater=0.7X 10-23 —0.14%< 10-23 = 
=0.5610-23 Gaussian unit. 


This means that a benzene molecule does hot have a rigid dipole moment, 
but a water molecule does. 


2. Let us determine the rigid dipole moment of a water molccule from the 


formula p= //3kTb. If the molecular polarisation # ‘and the molecular 
refraction R are measured at room temperature (T = 300°K), 


p= V3 1.38 x 10-1 x 300 X 0.5 X 10-23 = 0.8 x 10-48 Gaussian unit. 


This value is in close agreement with experimental results. 

Frequently, the unit 1 debye = 107-18 Gaussian unit is used as a measure 
of dipole moment. This unit is named after the German scientist Debye, who 
developed the theory of dipole moments. 


261. Additivity of Molecular Refraction 


The refraction R is a molecular constant. R does not depend on 
the density or phase of a substance (this has been demonstrated exper- 
imentally), nor on the temperature of the given substance. A conven- 
ient property of refraction is its additivity. If it is possible to com- 
pile a table of property increments* for all possible atoms, and if 
the magnitude of the property is determined as the sum of the incre- 
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ments, such a properly is said to be additive. The additivity of R 
can be used for analytical and identification purposes. (It should be 
noted that this additivity possesses no theoretical basis and in a num- 
ber of cases there occur significant deviations from the ideal.) 

Innumerable observations have been processed by numerous inves- 
ligators, and tables of R increments have been compiled. (Most of 
these tables are for Rp increments, i.e., measurements of the index 
of refraction for the so-called D-line, the yellow line of sodium.) 
For example, for C, H and Cl atoms, the increments are equal to 
2.418, 1.100 and 5.967, respectively. By means of these values alone, 
one can predict the molar refraction of many compounds: 


methane, CH,: R=2.418 -+-4> 1.100; 
chloroform, CHCl3: R= 2.418 +- 1.100+ 35.967; 


carbon tetrachloride, CCl,; R=2.418-+-45.967, 


etc. Refractions can be measured with great accuracy and when neces- 
sary extremely small differences can be determined. 

In view of dispersion anomalies, which, as was explained earlier, 
occur at frequencies close to the 
natural frequencies of absorption, 
refraction should be measured in 
a region far removed from the 
absorption bands. 

Indexes of refraction are meas- 
ured by means of refractometers. 
Most refractometers measure the 
angle of refraction of a beam of 
light (emerging from a material 
under investigation) which impinges 
on the surface of a prism made of 
glass with a higher n. 

If a bundle of rays with an angle 
of incidence of 0° to 90° reaches 
the boundary between the mate- 
rial under investigation and the 
glass, the refracted rays will lie 
between 0° and a certain critical 
angle @ the sine of which will be 
equal to the ratio of the index of 
refraction of the material under in- 
vestigation to that of the glass of 
the prism (see Fig. 286). The criti- 
cal angle is indicated by a sharp 
line in the focal plane of the tube. Fig. 286 


* These are the contributions of a given atom to the value of a given phy- 
sical quantity. 
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To measure the index of refraction of a liquid, we must place 
a 0.5-mm layer of it on the surface of the prism. When measuring 
solids, the material must make close contact with the surface of the 
prism. Optical contact is achieved by using drops of an appropriate 
liquid between the surface of the prism and that of the material 
under investigation. The index of refraction of a powder may be 
determined by immersing the powder in a liquid whose index of 
refraction is the same as that of the powder. 


262. Pyroelectric and Piezoelectric Materials 


A crystal which does not have a centre of inversion included in 
its symmetry elements may possess a number of interesting proper- 
ties. Such crystals may have an electric moment (polarisation vector) 
in the absence of an external field. 

First, let us direct our attention to crystals which become polar- 
ised with homogeneous deformation*. This property is characteristic 
of piezoelectric crystals, which were discussed in Sec. 45. 

The occurrence of polarisation on com- 


2 pression, extension, etc., shows that ho- 
t mogeneous deformation results in the 
ae creation ofa special, i.e., single (not mul- 


tiplied by the number of symmetry ele- 

ments) direction. Such behaviour is nol 

possible when a crystal has a centre of 

Le inversion. Homogeneous deformation can- 

yam) not eliminate a crystal’s centre of inver- 

sion. At the same time, Lhe existence of a cen- 

tre of inversion is incompatible with the 

existence of a special direction such as the 

polarisation vector direction. Any crystal 

that does not have a centre of symmetry 

may possess piezoelectric properties. Nev- 

ertheless such properties are not found in 

many crystals of this type. This may be 

Fig. 287 due to the fact that instruments are not 

sufficiently sensitive. However, we may 

conceive of a noncentral symmetric structure in which a homoge- 

neous deformation does not displace the centre of gravity of 

positive charge relative to the centre of gravity of negative charge. 

Close examination shows that the piezoelectric effect is not possible 
in one of the noncentral symmetric groups of symmetry. 


* This is a deformation in which all volume clements are deformed in the 
same manner. 
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The most common piezoelectric material is quartz. Fig. 287 
shows how piezoelectric plates may be cut from a quartz crystal. 
The nature of atom displacements can be assessed from Fig. 288. 
The structure of quarlz may be pictured as a compact packing of 
oxygen ions in the empty spaces of which silicon atoms are located. 
(In the figure, silicon aloms are represented as black spheres and oxy- 
gen atoms as while spheres.) The oxygen ions carry a negative charge 
and the silicon ions a positive charge. A silicon atom is surrounded 
by four oxygen aloms. Electrisation of quartz occurs when it is com- 
pressed along the polar axes. Pressure applied along axes lying in 
the plane of the figure results in the dis- 
placement of ils positive charge relative 
to ils negative charge. Pressure along an 
axis of the third order (a nonpolar direction 
perpendicular to the plane of the figure) is 
ineffective. 

Atom displacements cannot be shown in 
the figure. These displacements are quite 
negligible and cannot be detected by objec- 
Live methods (e.g., X-ray structural analysis). 
The piezoelectric constant of quartz, i.e., Fig. 288 
the magnitude of the polarisation vector 
for unit pressure, is equal to 6.5X 10-® Gaussian unit. The volume of 
a unit cell of quartz is equal to 112 x 10-*4 cm®. Since there are three 
SiO. molecules in a cell, the number of molecules in a unit volume is 
equal to 2.7 x 10” and, therefore, the dipole moment per molecule 
for unit pressure is equal to 2.4 x 10-*° Gaussian unit. The charge 
of a molecule is equal to 14 + 2 xX 8 = 30 electron units. Therefore, 
the displacement of the centre of gravity of positive charge relative 
to the centre of gravity of negative charge is a negligible quantity 
of the order of 10-!§ cm. 

Piezoelectric crystals include a class of materials known as pyro- 
electric crystals. Such crystals are naturally polarised under normal 
temperature and pressure. Usually this effect is masked by the free 
surface charge which accumulates along the boundaries of the crys- 
tal, but it may be detected when the temperature of the crystal is 
raised. Hence the designation pyroelectric (pyro means fire). 

Pyroelectric crystals have even more restricted symmetry. Only 
a crystal having a special axis can be termed pyroelectric. Thus, the 
mere absence of a centre of symmetry is insufficient. The significance 
of this condition is evident. The presence of natural polarisation 
indicates that such a special direction is present in pyroelectric crys- 
tals, while in the case of piezoelectric crystals such a direction 
appears only under the action of mechanical deformation. One of 
the most common pyroelectric substances is tourmaline. 
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A pyroelectric crystal has a very strong internal electric field. 
Therefore, the superposition of an external field does not change the 
polarisation of such a crystal, i.e., the polarisation cannot be in- 
creased, decreased or rotated. Such acrystal is polarised to saturation— 
all particle dipole moments are parallel. In the theory of ferromagnet- 
ism (see Sec. 266, which may be read with profit at this point). 
a region in which the magnetic moments of atoms are parallel is 
called a domain. The same term is applied to a region in which the 

electric dipole moments of all particles 
are parallel. A pyroelectric crystal usu- 

ZON SU ally constitutes a single domain, but there 
are exceptions. A number of substances 
may yield multidomain crystals. Such 
pyroelectric substances are known as 
Seignette electric, or ferroelectric, sub- 

stances. The former designation is de- 
en Oye rived from Seignette salt, in which B. V. 

Kurchatov and P. P. Kobeko first detect- 
esa O00 On ed this effect. The latter designation 
emphasises the close similarity between 
these substances and ferromagnetic oncs. 
Both ferromagnetic and ferroelectric sub- 
stances have very large dielectric constants (values of several - 
hundred or even thousand), pronounced hysteresis effects, and 
Curie points. The discussion of Sec. 266 is completely applicable 
here and will not be repeated. The points made about the effect of 
a field, polarisation by the displacement of domain boundaries, and 


the reasons for the division of a crystal into small domains all apply 
to ferroelectric materials as well. 


A large number of ionic pyroelectric crystals are ferroelectric. 
Several of these are particularly suitable to demonstrate ferroelectric 
properties. Barium titanate, BaTiO3, is typical in this respect. 
At 120°C, barium titanate loses its special properties and becomes an 
ordinary dielectric. At temperatures above 120°C, this substance has 
the simple unit cell shown in Fig. 289. The cell is cubic; at the centre 
there is a titanium atom, at the cornersof the cube barium atoms, and 
at the face centres oxygen atoms. The cell has a central symmetric 
structure; above 120’, the crystals no longer exhibit pyroelectric prop- 
erties. When the temperature is reduced a phase transition occurs 
and the structure changes: one of the cube edges becomes 1 per cent 
longer than the other two and the cube is transformed into a tetra- 
hedron. In this process, the titanium atom is displaced in the direc- 
tion of one of the oxygen atoms. This now becomes the special direc- 
tion and the polarisation vector will be parallel to this line. A bari- 
um titanate crystal has three directions of weak polarisation, rather 


Fig. 289 
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than one, since the displacements along the three axes of the cube 
are equal, 

When the subslance is cooled below the Curie point (120°C for 
a barium titanate crystal) different regions of the crystal may be 
transformed into domains with different orientations. A crystal which 
has acquired a domain structure is in a state of mechanical stress, 
i.c., some portions of the crystal are compressed and others extended. 
Strictly speaking, a'domain crystal is not a monocrystal—three-di- 
mensional long-range order throughout the crystal] is no longer present. 

On further reduction of temperature, barium titanate undergoes 
yel. another phase transformation at about -+ 10°, but it does not 
cease to be ferroelectric. 

Seignetle salt behaves differently. It possesses ferroelectric proper- 
ties only within a narrower temperature inlerval—from —20°C 
to -+ 24°C. ” 


CHAPTER XXXVI 


MAGNETIC SUBSTANCES 


263. Three Groups of Magnetic Substances 


Substances may be classed into three groups in accordance wilh 
their magnetic properties: diamagnelic, paramagnetic and ferromag- 
netic. The values of diamagnetic susceptibility lie in the range of 
—13 x 10-° (bismuth) to —0.8 x 10-% (copper). Paramagnetic bod- 
ies are characterised by positive susceptibility—for example, 0.4 x 
% 10-% (potassium) and 320 x 10-® (iron chloride). Ferromagnetic 
bodies are characterised by large values of permeability. These are 
hundreds and even thousands of times greater than those of other 
bodies. Let us examine the structural features which explain these 
great differences in magnetic properties for substances which other- 
wise do not show great differences in properties. 

Diamagnetism, it will soon be seen, is a universal property of all 
bodies inasmuch as they consist of electrons. The above values show 
that diamagnetic properties are weaker than paramagnetic ones and, 
a fortiori, weaker than ferromagnetic properties. Diamagnetic proper- 
ties may be detected only in the absence of properties resulting in 
positive magnetism. Paramagnetic and ferromagnetic bodies have 
diamagnetic properties, but they are obscured by the stronger positive 
paramagnetism. Thus, diamagnetism exists for any system containing 
electrons. On the other hand, positive magnetism arises only in 
bodies the atoms of which possess a magnetic moment. The phenom- 
enon of paramagnetism is very similar to.the process of electrisa- 
tion of a dielectric, which consists of rigid dipoles possessing a con- 
stant dipole moment. 

The presence of a magnetic moment in atoms is also a necessary 
condition for the existence of ferromagnetic properties. However, the 
peculiarities of ferromagnetic substances are due to a very specific 
property, viz., the formation within a body of vast regions—do- 
mains—within which the magnetic moments of thousands of millions 
of atoms are arranged parallel to one another. 


264. Diamagnetism 


Diamagnetism is a direct consequence of the tendency for an elec- 
tron to move in a circle in a magnetic field. 
In a magnetic field with an induction B, an unbound charged par- 


i é : F eB 
ticle moves in a circle with an angular frequency Oe Tt can 
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be rigorously proved that the action of a magnetic field on an elec- 
tron moving in a central field—in particular, in the field of an atomic 
nucleus—produces an analogous effect: the electron will move in 
eB. 
2me 

This motion is superimposed on other motions which may be per- 
formed by the electron: the chaotic motion of particles of the electron 
gas or the motion of the electron about an atomic nucleus. 

The fundamental considerations discussed on p. 494 showed that 
such molion may be equated to a circular electric current. When the 
magnetic field is switched on, the electrons begin to rotate about the 
magnetic field and each produces an M 
elementary current , 


a circle aboul a line of force, but at one-half the frequency, viz., 


Multiplying this value by the area of the = 
circle described by an electron in its 7 ~ 
motion about a line of force, we obtain VU 
the value of the diamagnetic moment [ 
created by one electron: 

M=—+<°s. —-* SB. 


2a 4nme* 


The reason for the minus sign is clear 
from Fig. 290: the direction of the moment. B 
is opposite to that of the field. Fig. 290 
When a system consists of a large num- 
ber of electrons, we must take the summation of the above 
expression with respect to all the electrons: 


e2 
M=— 5 >) SB. 


Since by definition (see p. 290) magnetic susceptibility is equal to 
the ratio of magnetic moment per unit volume (or unit mass or mole) 


to induction, 
Nee wy 
~~ ~~ Anme? » Si. 


If N is Avogadro’s number, x represents molar diamagnetic suscep- 
tibility (in comparing with the resulls on p. 290, note that x ==) : 


Thus, x is given by the areas circumscribed by electrons in their 
secondary motion in the magnetic field. In principle, this computa- 
tion can be made if we know the wave function of the system, i.e., in 
the final analysis, the electron density. Actually, since the computa- 
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tion is very cumbersome, the diamagnetic susceptibility is deler- 
mined experimentally. 

It should be emphasised that diamagnetic susceptibility is deter- 
mined by the electron structure of the system and does not depend 
(at least for atoms and molecules) on external conditions, including 
temperature. 

Diamagnetic susceptibility, like molecular refraction, possesses 
additivity. If the diamagnetic susceptibility is taken for a mole of 
substance, the susceptibility 7% of a molecule may be expressed wilh 
considerable accuracy as 


%= DNAaXar 


where n, is the number of atoms of type A in the molecule and y, 
is the increment for the given atom. For purposes of illustration, we 
can use the same example as for refraction (see p. 661). C, H and Cl 
atoms have the increments 7.4, 2.0 and 18.5 (y4 x 10°), respective- 
ly. Thus, we obtain 15.4 for methane, 64.9 for chloroform, and 81.4 
for carbon tetrachloride. These values are in close agreement with 
experimental results. 

_ The significance of this additivity consists probably in the follow- 
ing: outer electrons weakly affect diamagnetic susceptibility. In 
so far-as additivity is realised, diamagnetic susceptibility is an 
atomic rather than a molecular property. 

_ Diamagnetic susceptibility, as indicated in the preceding article, 
1s a property associated with substances the atoms and molecules 
of which do not have a constant magnetic moment. Such particles 
include in the first place atoms and ions with completed shells— 
the ions F~, Cl- and Nat and atoms of the noble gases. Atoms and 
ions which in addition to a completed shell contain two more s-clec- 
trons with anti-parallel spins, e.g., Zn, Be, Ca and Pb*+, are also 
diamagnetic. 

The group of diamagnetic molecules is incomparably larger than 
the group of paramagnetic molecules. The latter exists more in the 
nature of exceptions. This is due to the fact that practically all mole- 
cules have valent bonds formed by a pair of electrons with anti-par- 
allel spins. Usually, the total moment about a nucleus, as well 
as the spin moment, equals zero in such molecules. Thus, bodies 
consisting of atoms and ions such as those cited above and practical- 
ly all bodies the building blocks of which are molecules—therefore, 
practically all organic substances—are diamagnetic. 

Diamagnetic susceptibility describes the electron cloud of a mole- 
cule. If the distribution of electrons in a molecule is strongly aniso- 
tropic, its magnetic susceptibility is also anisotropic. The anisotropy 
of diamagnetic susceptibility is manifested particularly in molecules 
of the aromatic compounds. For example, in benzene, x, the molar 
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diamagnetic susceptibility in a direction lying in the plane of a ben- 
zene ring, equals —37 x 10-%cm*/mole and x, the molar diamagne- 
tic susceptibility in a direction perpendicular to the plane of a ring, 
equals —91 x 10-& cm*/mole; in naphthalene y , = —40 x 
-, 10-%em*/mole and x; = —190 x 10-®cm*/mole. Anisotropy may 
be detected by measuring crystals oriented in different directions in 
the field. Measurements of powders, liquids and gases yield a value 
of magnetic susceptibility for an averaged orientation. 


265. Paramagnetism 


A substance has paramagnetic properties if the atoms, ions or mole- 
cules of which it consists possess a magnetic moment. A magnetic 
moment is due either to the uncompensated spins of electrons in the 
alomic system or to the motion of electrons about nuclei, or both. 

As was explained earlier (see p. 498), a magnetic moment resulting 
from spin is related to angular momentum as follows: 


Ws = ty Vs(s+4), 


and a magnetic moment resulting from the motion of electrons about 
a nucleus is related to angular momentum as follows: 


bp =e VL (L+ 14). 


Here, wy is the Bohr magneton and s and L are, respectively, the 
total spin momentum and the total angular momentum for motion 
about a nucleus, taken for an atom or molecule as a whole. As pre- 


% é : h : 
viously, s and Z are expressed in units of i When paramagnetism 


is due to both effects, the formula for the magnetic moment of an 
atom or molecule takes the form 


= 8a VI (J +4), 


where J is the quantum number of the total quantum momentum, 
i.e., the vector sum of Zand s, and g is the Lande factor, which depends 
on all three quantum numbers. Incidentally, the proximity of g to 
1 ori 2 (established experimentally) is an excellent indicator of the 
origan of the magnetism of a given substance. 

Pramagnetic atoms and ions include particles having one electron 
over and above a completed shell (e.g., atoms of the alkaline metals), 
atoms of the transition elements, ions of the rare earth elements 
with incomplete shells, etc. 

Most molecules, as already indicated, are diamagnetic. Molecules 
of oxygen and sulphur, which are paramagnetic, are exceptions and 
have a total spin equal to 1. The magnetic moment obtained experi- 
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mentally is in close agreement with the value calculated by means 
of the formula 


p=: Quy V2. 


The presence of paramagnetism is proof of the fact that the mole- 
cules contain unpaired electrons. This circumstance makes the meas- 
urement of the magnetic properties of molecules of greal interest 
to the chemist. The so-called free radicals, which are chemical com- 
pounds with an unpaired electron, possess paramagnetic properties. 
Free radicals are created in a number of instances in chemical reac- 
tions, and the measurement of magnetic susceptibility is a possible 
method of studying the course of chemical reactions. 

How is the value of the paramagnetic moment of a molecule relat- 
ed to that of magnetic susceptibility? In paramagnetic bodies located 
outside a magnetic field, the magnetic moments are distributed ran- 
domly with respect to direction, and the total magnelic moment of 
a substance is equal to zero. When a field is switched on, the atoms 
(or molecules) will tend to rotate in such a way that their magnetic 
moment coincides with the direction of the field. As a resull, equi- 
librium is established between two tendencies: the ordering action 
of the field and the tendency to thermal randomness. The reasoning 
used on p. 656 to derive the value of the polarisability of a substance 
consisting of rigid electric dipoles is completely applicable here. 
Therefore, like in that case, the relationship between the magnetic 
moment of an atom (or molecule) and the paramagnetic susceptibilily 
of an atom is given by the expression 


2 

Xatom = oF , 
In contradistinction to diamagnetic susceptibility, the paramagnel- 
ism of a substance depends on temperature. To be sure, the situation 
here isssomewhat more complex than in the case of dielectrics. This 
is due to the fact that the electric moment of a molecule is a constant, 
while the magnetic moment of a molecule (or atom) may vary consid- 
erably with the temperature. Paramagnelic moment is related to 
quantum numbers, and the distribution of molecules according to 
state may depend greatly on temperature. Therefore, the simple law 
that magnetic susceptibility is inversely proportional to temperature 
(the Curie Law) may not be valid in the case of paramagnetic sub- 
stances. 


266. Ferromagnetism 


Domain. A small number of substances possess marked (using coarse 
observation methods) magnetic properties. These substances 
include iron, cobalt, nickel, gadolinium, compounds of these ele- 
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ments, and certain compounds of manganese and chromium. Since 
iron is the most important of these, such substances are said to be 
ferromagnetic. ; 

Atoms of a ferromagnetic substance have a magnetic moment, 
which, moreover, is caused by spin (at least, basically). However, 
it is not this feature that distinguishes il from a paramagnetic sub- 
stance. The main characteristic of a ferromagnetic substance is ils 
domain structure. Adomain is a 
region which is magnetised to 
saturation, i.e., a region in which 
all atoms are arranged with their 
magnetic moments parallel. Since 
the linear dimensions of do- 
mains are usually of the order of 
0.01 mm, they may be observed 
by means of an ordinary micro- 
scope. 

Domains exist in a ferromag- 
nelic substance in the presence 
as well as in the absence of a 
field. In order to observe do- rj 

: ‘ Fig. 291 

mains, we placeadrop of colloidal 

suspension—a finely divided sub- 

stance such as magnetite (Fe;0,)—on the polished surface of a ferro- 
magnetic monocrystal. Colloidal particles become concentrated close 
to the boundaries of the domains since strong local magnetic fields. 
exist along such boundaries (as in the case of any bar magnet) and 
they altract the grains of magnetite (see Fig. 291). 

First, let us consider certain problems arising in connection with 
one domain; then we shall study the arrangement of domains in crys- 
tals; and finally we shall examine the process of magnetisation of 
a ferromagnetic substance. St 

The orientations of the magnetic moments of atoms forming a sin- 
gle domain are not arbitrary. Every crystal of a ferromagnetic substance 
has a particular crystallographic direction along which it is most 
easily magnetised. In hexagonal cobalt this is a single direction—the~ 
hexagonal axis. In cubic iron this direction is the edge of a cube. 
This means that there are three directions of easiest magnetisation 
and accordingly three directions of magnetic moments of domains. 
In cubic nickel the spatial diagonals of a cube are axes of easiest 
magnetisation, i.e., there are four possible directions of magnetic 
moment. 

Why is it that atoms in a ferromagnetic substance arrange them- 
selves so that their magnetic moments are parallel? This is caused by 
a specific phenomenon—the interchange of positions by electrons. 
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As indicated in connection with a chemical bond, the overlapping of 
wave functions results in a decrease in energy. Ileclrons occupy 
a common space and are able to interchange positions. The tendency 
of exchange energy to become minimal is the reason for the stabilily 
of most chemical compounds. [xchange energy plays an analogous 
role in the creation of a domain. In the case of a chemical bond, the 
minimum value of exchange energy is achieved when the spins of 
interchanging electrons are anti-parallel. However, the general con- 
clusion of quantum mechanics is broader, i.c., the exchange energy 
may in certain cases be minimal for parallel orientation of spin and 
in other cases for anti-parallel orientation of spin. In ferromagnetic 
substances, the spins of atoms contained in a domain have a parallel 
orientation. Comparatively recently, a new class of compounds— 
anti-ferromagnetic substances—was discovered. In these substances, 
stable domain states occur for anti-parallel orientation of spin. 
From measured values of the magnetisation of a domain, one can 
calculate the number of spins per atom involved in ferromagnetism. 
Such numbers are not whole numbers (for iron 2.2, for cobalt 1.7, 
for gadolinium 7.1, etc.). It must be concluded that to a certain 
extent, the electrons forming an electron gas are also involved in the 
creation of ferromagnetism. However, in the main, electrons bound 
lo atoms are responsible for ferromagnetism. In iron, conduction 
electrons come from the outer 4s shell, while ferromagnetic electrons 
are in the 3d shell. i 
_The existence of remarkable materials known as ferrites constitutes 
direct proof of the absence of any conneclion between conduction 
properties and ferromagnetism. These materials are semiconductors 
with a specific resistance of 10 to 11 orders of magnitude greater than 
iron. Conduction electrons, of course, play no role in the magnetism 
of these substances. Ferrites are mixed compounds; for example, 
manganese ferrite is a 1:14 mixture of manganese oxide and iron 
oxide, and nickel ferrite is an analogous mixture of nickel oxide and 
iron oxide. Iron oxide contains two iron aloms, and nickel oxide one 
nickel atom. A crystal of the mixture represents a compact packing 
of oxygen atoms. The nickel atoms and the two iron atoms fit into 
the empty spaces. It was indicated on p. 600 that there are two kinds 
of empty spaces in a compact packing arrangement, viz., tetrahe- 
dral and octahedral. An atom which fils into an empty space of the 
first kind is surrounded by four neighbours, while an atom in an 
octahedral space has six neighbours. The iron atoms fit into both 
kinds of spaces. The magnetic moments of the iron atoms are quite 
ordered, but the moments of iron atoms in tetrahedral spaces point 
in one direction while the moments of iron atoms in octahedral spaces 
point oppositely. As a result, the actions of these two systems 
of moments cancel each other and the magnetic properties of such 
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a mixed oxide result from the magnetism of nickel, the moments of 
whose atoms are all pointed in one direction. 

The presence of exchange energy explains the tendency of atoms 
lo arrange themselves so thal their spins are parallel or anti-parallel. 
Apparently, in ferromagnetic 
substances, the exchange en- 
ergy of interaction becomes 
of prime importance and 
causes (the substance to havea 
spin arrangement such that 
the energy assumes a mini- 
mum value. In the remaining 
paramagnetic substances, oth- 
er components of interaction 
energy do not allow the ex- 
change energy lo make itself 
fell. 

The long-range order of 
atoms is destroyed at a cer- 
tain temperature: the crys- 
tal becomes fused. Temper- 
ature alfects the arrange- 
mentofthemagnelic moments 
in exactly the same manner. 
Fig. 292 shows schematically 
how the magnelic moments of the atoms behave’ when 
the temperature is raised. At first the vibrations are in phase, 
then disorder begins to prevail, and finally the magnetic order 
“mells away”. Beginning at a definite temperature called the Curie 
point, in honour of the oulstanding French scientist Pierre Curie, 
the order in the arrangement of arrows disappears and the substance 
loses ils magnetic properties, i.e., the ferromagnetic substance turns 
into a paramagnetic substance. For iron the Curie point lies at 770°C, 
for cobalt at 1,415°C, for nickel at 358°C and for gadolinium at 15°C. 

In an anti-ferromagnelic substance, the spin of atoms tends to 
assume an orderly, but anti-parallel, arrangement. The structure of 
a domain of manganese oxide, which is an anti-ferromagnetic sub- 
stance, is shown in Fig. 293. Arrows represent the moment of manga- 
nese. From the figure, we see that the chemical period of structural 
repetition is one-half of the magnetic period. At absolute zero each 
atomic magnet of the anti-ferromagnetic substance is surrounded 
by atoms with oppositely direcled moments. As in the case of a fer- 
romagnelic substance, this order is destroyed at a definite Curie 
temperature and above this critical point it behaves like a para- 
magnetic substance. 


c) 


Fig. 292 
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The existence of various anomalies in the behaviour of a body in 
passing through the Curie point is indirect evidence of the existence 
of anti-ferromagnetic properties. Since the Curie point is a point of 
phase transition of the second kind} a number of properties undergo 
an abrupt change in passing through it. 

Direct evidence of the existence of anti-ferromagnetic properties 
has been obtained by means of neutron diffraction methods. The 
scattering of neutrons by a lattice (see Fig. 293) is sensitive to the 
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Fig. 293 


chemical period, rather than to the magnetic period, of structural 
repetition. 

Domain Structure of a Crystal. In examining the domain structure 
of a ferromagnetic monocrystal by the powder method, which was 
described earlier, we note that a domain is never very large, i.e., its 
linear dimensions are usually no greater than 0.01 mm. It is found, 
moreover, that cubic ferromagnetic substances have extraordinarily 
symmetric combinations of differently oriented domains. These 
two circumstances require explanation since, it would seem, that 
thanks to the ease of magnetisation an entire crystal should be trans- 
formed into a single domain , 

L. D. Landau and E. M. Lifshits have shown that a domain struc- 
ture of the kind shown in Fig. 291 is a natural consequence of the 
existence of different energy forms in a ferromagnetic body. The 
essence of the theory is illustrated in Fig. 294. The first diagram 
corresponds to a single domain, the magnetic energy of which is 


x \ H?dt. But the energy corresponding to the second configuration 
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is only one-half of this value. In the case of V parallel domains, the 
energy will be about + of that for a single domain. However, this 
dividing process will be advantageous only up to a certain point. 
Beyond that point, the energy of boundary layers exceeds the decrease 
in energy associated 

with the division of a 

crystal into domains. The NN NN es 
advantage of configura- 


Fi . < | 
tions which consist of | 
| 
| 


domains forming closed { { | 
circuits is evident. In 
such cases, a closed mag- | 
netic flux circuit is | 
formed and the energy of SS §8S SS VN 
the field outside the crys- 7 


tal equals zero. 

In the case of cobalt, 
which has a magnetisa- 
tion direction along its 
axis, we encounter do- 
mains in which the mo- 
ments are oriented only 
along the axis ofa hex- 
agon. The zero magnet- 
ic moment of a body 
in the absence of an 
external field is realised Fig. 294 
as follows: half of the 
domains have one orientation and the other half the op- 
posite orientation. 

A few words regarding the boundary between domains. This bound- 
ary layer is shown schematically in Fig. 295. We see that in this 
layer the magnetic moments gradually change direction. The thick- 
ness of the layer is determined by the requirement for minimum 
energy. Two opposite tendencies occur here. On the one hand, it is 
desirable to extend over a thick layer—this being of greater advantage 
with respect to exchange energy—the disadvantageous process 
of spin turning. On the other hand, itis better to complete this proc- 
ess rapidly since in the transition layer the spins are at an angle 
to the directions of easiest magnetisation. 

Now, let us consider what happens in a ferromagnetic substance 
when an external field is switched on. The magnetisation process 
may be followed by the powder technique. It transpires that the 
basic mechanism of magnetisation consists in the growth of a domain, 
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which points in the “required” direction, by means of boundary dis- 
placement. The.domains which form an acute angle with the field 
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Fig. 295 


“swallow up” those which form an obtuse angle with the ficld. At 
the beginning of magnetisation, domain boundary displacement is 


Fig. 296 


reversible, i.e., when the field -:is switched off 
the initial boundaries of the domains are re- 
stored. Later on, domain boundary displacement 
becomes irreversible. Finally, when a very high 
degree of magnetisation is reached, the direc- 
tion of magnetisation of the domains be- 
gins lo turn. This is illustrated in Fig. 296, 
In polycrystalline substances, the situation 
is exactly the same (assuming the crystal par- 
ticles are nol extremely small since domains 
are not formed when the size of the particles 
are less than 10~° cm), i.e., each grain may 
consist of several domains. In so far as the 
crystallographic axes of the particles have 
random orientations in the body, however, 
the magnetic moments of the domains orient 
themselves randomly. Thus, the simple mag- 
netisation diagrams which have come down 
to us from the days of Ampere provide a cor- 
rect picture of polycrystalline substances. 


CHAPTER XXXVII 
EFFECT OF ELECTRON STRUCTURE ON PROPERTIES 
OF BODIES 


267. Free Electrons 


Until now, in discussing the structure of solids and liquids, we did 
nol pay particular allention to the role of electrons in the formation 
of the properties and structure of these bodies. We were able to do 
this because the electron structure of bodies is by no means always 
of prime importance. [In a number of cases, however, the role of elec- 
trons must be taken inlo account. There are two “kinds” of electrons 
ina body, viz., bound electrons and unbound (free) electrons. Bound 
electrons are component parts of a specific atom, ion or molecule. 
Unbound electrons belong to the entire crystal or liquid and may 
move quile freely between aloms. 

In molecular substances, the picture of electron structure is 
particularly clear. In most cases, there are no common electrons, 
i.e., none of the electrons leaves the “bounds” of a molecule. In 
ionic crystals, the restriction of electrons is not quite so clear. Even 
according to Lhe classical view of an ionic bond, it cannot be assumed 
that electron exchange is completely absent. Nevertheless, elec- 
Lrons passing from ion to ion (exchange electrons) in ionic crystals do 
not behave like free electrons; their displacement in such a crystal 
consists in the transfer of an electron from one atom to its neigh- 
bour. This is quile clear in crystals having a homopolar bond. Diamond 
is an insulator, although the electrons binding the carbon atoms 
are by no means restricted lo specific positions, bul are relayed from 
alom lo atom. 

Metals differ quile considerably from all of the bodies mentioned 
above. Here, we encounter electrons for which the term “free” is 
entirely justified. Electrons are displaced in a metal just like gas 
particles in a tube filled with obstructions. Atomic products (ions) 
in a slate of thermal vibration act as obstructions. The presence of 
free electrons is revealed primarily in conduction phenomena as well 
as all experiments involving the escape of electrons from a body. 
This type of phenomenon could not be explained without considering 
the peculiar behaviour of common electrons. 

It would be incorrect, of course, to assume that the division of 
electrons into bound and free electrons is absolute. This may rather 
be considered an idealised division. In solids we may encounter elec- 
trons which are bound in various degrees. This became particularly 
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evident when physics assigned a rightful place to semiconductors, 
which occupy an intermediate position between a system of ideally 
free electrons and a system of exchange electrons, or a system of 
electrons bound to molecules. It is now known that any transi- 
tional type of structure is possible. 

It should be noted that an electron in a solid, just like an atomic 
electron, obeys the laws of wave mechanics. The representation of 
an electron as a spherule has validity within the limils imposed’ by 
the principle of uncertainty. Usually it is physically meaningless 
to speak of the path of an electron inside a metal. A description of 
the electron structure of a body consists primarily in indicating 
how its electrons are distributed according to energy. 

Theory shows that the electrons of a body may be represented as an 
electron gas, but this statement must be properly qualified. It tran- 
spired that it is possible to picture the electrons of a metal as a gas 
of fictitious particles, the effective mass of such a particle being 
dependent on its direction of motion. This point is made in order to 
caution the reader against making a superficial analogy between an 
electron gas and a gas consisting of molecules. 


268. Energy Levels in a Solid 


The energy levels of a free atom were discussed earlier. Such 
energy levels may be determined experimentally, i.c., by observing 
the energy transitions which occur when light is emitted or absorbed. 
When an atom possesses many electrons, a unique group of four 
quantum numbers is associated with each electron; according to the 
Pauli exclusion principle only one electron may exist in a given quan- 
tum state. Therefore, energy levels have a limited capacity. The s 
levels of an atom may contain two electrons, the p levels six, etc- 
This information may he determined experimentally and as a conse- 
quence of the fundamental laws of quantum mechanics. 

To determine the energy levels of a system consisting of a large 
number of atoms, we should use both approaches here as well. The 
fundamental theoretical concepts remain unchanged in the case of 
a system consisting of thousands of millions of atoms. Therefore, it 
may be concluded that the number of quantum states in a system 
consisting of 2 atoms will be n times the number ina free atom. Thus, 
the Pauli exclusion principle can he satisfied: only one electron 
will exist in a given quantum state. 

An atom never completely loses its identity in a given body. Spec- 
tral investigations indicate that significant changes affect only outer, 
valence electrons, which are responsible for interaction between 
atoms. Therefore, the quantum states of asolid must be closely relat- 
ed to the quantum states of an atom. Let us consider, for example, 
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the electrons of a A shell, which is closest to the nucleus of an atom. 
It is evident that the state of such an electron can change only very 
little when atoms are combined in a body. Nevertheless, the Pauli 
exclusion principle does not allow us to consider all A electrons as 
equal. It becomes necessary to assume that n extremely close A 
levels, each of which consists of a pair of electrons of opposite spin, 
exisl ina hody consisting of mz atoms. 

This reasoning also carries over to the other energy levels. It is 
assumed that the quantum states of a body are related to that of an 
alom by the following rule: 

a body consisting of n atoms 

has n times as many energy 3P 
levels as an individual atom. 38 
Each level of a free atom 

yields 7 close levels in a solid 

body. This means that the ener- 

gy levels of a body may be 2P 
viewed as a system of bands. 28 
Each band is a split level of 

an atom. Therefore, the same 
designations may be used for © 

the band as inatomic spectro- yg 
scopy: 1s, 2s, 2p, etc. The 
number of electronsin a band i/r 
will be, of course, m times the . 

number of electrons in the Fig. 297 

cor responding shell of an atom. 

Thus, in the 4s and 2s bands there will be 2 electrons, in the 2p 
band 6n electrons, etc. 

The width of a band depends on the interaction forces between 
atoms. This concept is illustrated schematically in Fig. 297. The 
energy levels of a sodium atom are shown at the left and the expan- 
sion of the levels into bands in the formation of a crystal lattice are 


shown at the right. The quantity — is plotted along the abscissa. 


Perceptible expansion of the 1s level diss not occur since the required 
interatomic spacings are absolutely unrealisable. The 2s and 2p 
bands are also practically unexpanded under normal conditions 
(indicated by a dotted line). On the other hand, the 3s and 3p bands 
are expanded to such an extent that they overlap. This means that 
the interaction which occurs between sodium atoms under normal 
conditions affects only outer electrons. (Sodium has no electrons in 
the 3p state. Nevertheless, we shall also be concerned with unoccu- 
pied energy levels when the exciting energy is sufficient to transfer 
an electron to such a level.) 
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What is the significance of the overlapping of the 3s and 3p bands? 
Actually, our scheme of correspondence between the energy levels of 
an atom and a solid fails in this case. However, we shall not let this 
disturb us. The overlapping of the bands signifies that the wave func- 
tion properties of an electron in the overlapping region differs from 
the wave function properties of an alomic electron. Thus, the outer 
electron of a free sodium atom is ans electron. In liquid and 
solid sodium the 3s and 3p bands overlap; the behaviour of the outer 
electrons of sodium differs from the behaviour of an s electron, i.e., 
certain special (hybrid) properties appear (such electrons reflect the 

peculiarities of s and p wave func- 
n(é) tions). 

The described behaviour may be 
established experimentally by means 
of spectral methods. The. presence of 

E— an energy band rather than a distinct 
energy level can be established by 
n(é) examining the transition of electrons 
from a high band to a low one. That 
which would have produced a sharp 
line in the case of a free alom now pro- 
oe duces a broad spectral band. 

It is more convenient to examine 
transitions from an energy band toa 
single distinct level—for example, in 
the case of sodium, transilions lo the 
2p level. The spectral band obtained 
in this manner enables us lo deter- 

€—~ mine the width of the energy band as 

Fig. 298 well as the electron distribution ac- 

cording to energy. In the case of sodium, 

this requires that eleclrons be dislodged from the 2p shell. The frequen- 

cies of the resulting transitions lie in the region of soft X-rays (sev- 

eral hundred Angstroms) and are very difficult to detect. Special 

X-ray tubes, in which the anode serves as the material under in- 
vestigation, are used in such studies. 

From measured values of the intensity of the obtained spectral 
band, we may plot a curve of intensity as a funclion of the frequency 


n(é) 


v. Butv = # (where @ is the transition energy, i.e., the energy rela- 


tive Lo the distinct level), and the intensily at a given v is proportion- 
al to the number of electrons having an energy &. Curves of x (8) 
as a function of &, where n (@) is the fraction of electrons in the 
band having an energy between ¢ and ¢ + dé, may be plotted 
from experimental data. Three typical curves are shown in Fig. 298. 
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The first curve corresponds to an energy band inwhich themaximum 
energy is sharply defined. This signifies that all lower energy levels 
are occupied. The abrupt drop in the curve indicates that the lower 
levels are filled lo capacily (two electrons per level). The second curve 
is Lypical of elevated temperatures. In this case, the edge of the band 
is smeared and the order of magnitude of the width of the smeared 
region is equal to &A7. This means that some of the electrons are in 
an excited stale and can occupy higher levels. The third curve, which 
shows two nonoverlapping bands, is very interesting. The lower 
band is filled and the upper one is partially occupied. A forbidden 
band is located between the two allowed bands. 


269. Electron Gas 


It. is evident from the preceding article that. as far as solid-state 
theory is concerned only the upper energy bands are of interest, since 
electrons at lower levels practically do not take part in interactions 
between atoms. How can the behaviour of upper band electrons be 
described? Since we are dealing with a very large number of electrons. 
it is natural lo use stalistical physics methods and consider an aggre- 
gate of such electrons as a kind of gas. 

The state of each electron of such a gas may be represented by 
a point (py, Py, Pz) in momentum space. The direction of motion of 
an electron is parallel to its radius vector p and the energy of an 
electron depends on ils momentum. In a crystal, the energy of an 
electron will depend on its direction of motion. Let us disregard this 
for a moment and assume that the electrons behave like free parti- 
cles. Despile the fact that this is a rough approximation, i.e., that 
we neglect the potential energy of the field in which the electrons 
move and the inleraction between electrons, the results give us a good 
description, al least qualitatively, of the behaviour of the electrons 
of a solid which form an energy band. 

If the electrons are free, the relationship between their energy and 


; : 1 er eee ee : 
momentum is given by the formula é=7,, P’- This means that in 


momentum space a surface of equal energy isasphere. It is custom- 
ary to call sucha sphere a Fermi sphere, after the famous Italian 
physicist. As indicated in the preceding article @,,.4,., the maximum 
energy of the electrons in a band, may be determined experimental- 
ly. We can say, therefore, that the states of an electron gas are con- 


tained ina sphere of radius Pmax = V 2B max Thus, it would not be 
incorrect to call this Fermi surface a surface of maximum energy. 

To qualitalively check the validity of this theory, let us estimate 
from the value of @nax the number of electrons in a band. We may 
reason as follows. According to the principle of uncertainty, the 
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projection of the momentum of a particle in a metal body of linear 
‘dimension LZ cannot be determined with greater accuracy (han +. 
Therefore, in momentum space, the concept of a point should be 
replaced by the concept of a cell of volume —- where V is the volume 


of the metal body under consideration. One of the basic postulates of 
the theory is that such a cell represents a quantum slate and that it 
can contain no more than two electrons of opposite spin. If there are 


? s “ ‘ iV 

NV electrons in a volume V in the band under consideration, then > 
‘ F N 2} rae a 

-cells are occupied, i.e., the volume op This is the volume of a Fer- 


mi sphere of radius Pmax- Thus, 
4 5 Eee aan i) N 1h 
3h (V 2mE max) = > . 


_ Perfectly reasonable values of N may be obtained using this equa- 
‘tion. This means that the above assumplLions are more or less valid. 


Example. In a metal, the maximum energy, determined experimentally 
is 6max ~ 10 ev = 16 x 10!" erg. Using this value, we obtain Pmax = 


= V2mbmax~ 2x 107-19 gm cm/sec, i.e., the maximum electron velocity 
in a metal is 


: 210719 
ge Pmax : 2x 108 cm/sec. 


Hence, the number of electrons in a unit volume is 


N=2x 42 (VInEmax)® ap ~ 10%, 

The above discussion assumed that the temperature is at absolute 
zero. At a higher temperature, electrons may pass over into momen- 
tum-space cells which correspond to higher energy. Such a transition 
will take place for electrons located in cells close to a Fermi surface 
(otherwise too high a transition energy is required, which is unlikely 
to obtain) and the boundary of the sphere will be broad (not distinct). 
Only at very high temperatures will the excitation affect low-energy 
electrons. As the temperature is increased, the degree of degeneracy of 
the electron gas decreases. An electron gas has a high degree of degen- 
eracy, particularly at low temperatures. The term “degeneracy” 
signifies that different quantum states have one and the same energy. 

The distribution of electrons according to energy at a given temper- 
ature may be calculated. This distribution differs from a Boltz- 
mann distribution. According to the Boltzmann law, at absolute 
zero, the energy of electrons should be equal to zero. From the view- 
point of the new theory, electrons should have a high energy at 
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absolute zero (this follows from the Pauli exclusion principle).* On 
the basis of the Pauli exclusion principle, we can construct a new 
& 
slalislics (Fermi-Dirac statistics), in which the function e *? is 
replaced by the expression 
1 
CEC max 
e kT Ley 


where @max is the maximum possible energy of the electrons at abso- 
lute zero. Multiplying this factor by the electron distribution at 
absolute zero yields the electron distribution at any temperature 


Fig. 299 


Fig. 299 shows the dependence of the Fermi-Dirac function on @ 
when kT = 0.1 and 2.5 ev. 

It should be noted that different particles obey different statistics. 
Molecules obey Boltzmann statistics, photons Bose-Einstein statis- 


. : : . 1 . 
tics, and electrons (and other particles having a spin of 5) Fermi- 


Dirac statistics. 
The difference in statistical approaches consists in the different 
methods of distribution of particles according to their possible states. 
Let us assume that there are two states in which two particles 
may be possibly located. In Boltzmann statistics, where particles 
possess individuality, the following possibilities must be considered: 


* Gmax has an order of magnitude of several clectron-volts, while the av- 
erage energy of thermal motion (k7) is equal to several hundredths of an elec- 
tron-volt. Thus, electrons move rapidly even at absolute zero. The velocity 
of electrons at absolute zero is 1,000 times as great as the velocity of atoms at 
room temperature. This should be re-emphasised in order to make it clear that 
the relationship existing between kinetic energy and temperature in the case 
of molecules is not applicable in the case of electrons. It follows, moreover, 
that an electron gas has negligible thermal capacity. The thermal capacity of 
a body is not affected hy the presence of an electron gas. 
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1) both particles in the first stale; 2) both particles in the second 
state; 3) the first particle in the first stale and the second in the sec- 
ond state; 4) the first particle in the second state and the second 
in the first state. Thus, there are four possibilities in all. 

In Bose-Einstein slalistics, particles are indistinguishable from one 
another. There are therefore three possibilities: 1) both particles in 
the first state; 2) both particles in the second stale; 3) one particle 
in the first stale and one in the other. 

In Fermi-Dirac statistics, the Pauli exclusion principle is taken 
into account: only one particle may be in a given stale. The number 
of possible distributions is reduced to one, i.e., one particle in each 
of the two states. | 

Thus, outer electrons of the atoms of a solid behave like an elec- 
tron gas. This is a very peculiar kind of gas, i.e., ils particles obey 
Fermi-Dirac statistics. 


270. Conductivity 


In the absence of an electric field, the state of an electron gas is 
such that the number of electrons moving from right to left is equal 
to the number moving from left to right. When a field is applied, 
forces are produced which make the electrons move parallel lo the 
field. The distribution of electrons in momentum space becomes 
nonsymmetrical with respect to the origin, i.e., a displacement 
occurs in the direction of the field. An ordered motion which pro- 
duces an electric current is superimposed on the extremely rapid 
random motion of the electrons. 

For the distribution of electrons to be displaced, electrons must, 
to be sure, pass to higher energy states. Such a transition is always 
possible if there are vacancies in the energy band. If the energy 
band is fully occupied, i.e., if all its levels are occupied by elec- 
trons as allowed by the Pauli exclusion principle, the electrons have 
no place to go—in any case, not until they acquire sufficient energy 
to make a transition to the next band. 

Were it not for the overlapping of bands (discussed above), one 
could assume thal all elements having one valence electron are 
conductors and all elements giving up two electrons to be shared 
in the process of forming a solid are insulators. Thus, sodium has 
one electron at the 3s level. In the formation of a body consisting 
of N sodium atoms this level splils into V levels. At each level there 
may be two electrons of opposite spin, i.e., a Lolal of 2N electrons. 
But since we have only N valence electrons, half of the energy band 
is unoccupied. Magnesium, the next element in the Mendeleyev 
periodic table, has two electrons (per atom) at the 3s level. There- 
fore, in the formation of a magnesium crystal, all levels would be 
occupied were it not for the overlapping of energy bands. 
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Investigation of the form of energy bands of various elements 
shows that the above explanation of the origin of conducting prop- 
erties is completely valid. Only when the upper band or the merged 
bands are not fully occupied may the body be classified as a conductor. 

The distribution of the electrons of a conducting body in momen- 
lum space may be displaced in the direction of a field. Since the 
number of electrons moving in the direction of a field does not equal 
the number of electrons moving in the opposite direction, an elec- 
tric current is produced. In an insulator, all the energy bands are 
completely occupied. Ordinary field intensities are incapable of 
creating the forces necessary to transfer electrons to the next higher 
band, and if we persist in trying to transfer electrons of an insula- 
tor to the next band the dielectric breaks down. The electron dis- 
tribution maintains ilssymmetry in momentum space and the num- 
ber of electrons moving to the left remains equal to the number of 
electrons moving lo the right, i.e., no current flows. 

To return to the discussion of conductors, let us now roughly esti- 
mate the magnitude of the electrical conductivity of a body which 
has n free electrons in a unil volume. By free electrons or conduction 
electrons, we mean electrons located in unfilled energy bands. 

Assume that the motion of an electron under the action of an accel- 


3 : . l 
erating force eZ occurs during a small time interval t = —. Here, 


v is the velocity of an electron and J the length of ils mean free path. 
The path is traversed at the extremely high random velocity of 
the electron. The velocity of the ordered motion of the electrons 
crealing the electric current is many orders of magnitude less than 
the random velocity and, therefore, is not included in the denomi- 


. : F . eE : 
nator of the expression for t. Motion with an acceleration a during 
i . : : eE 1 7 
a time interval increases the electron velocily to —-~. Thus, the 


approximate value of the velocity of the ordered motion of the 
‘ : eEl 
electrons creating the current is u ~ aa 
The densily of the electric current is simply the quantily of elec- 
lricily passing through a unil area per unit time, ie., 7 = new. 


Substituting -the above value of u, we obtain 


ne2l 
4 
mv 


Since Ohm’s law in differential form is given by j = oF, a rela- 
tion may be obtained for electrical conductivity: 


ne2l 
mv 
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This gives us only a rough estimation of the electrical conductiv- 
ity. In view of the assumplions made for purposes of simplification. 
calculated and experimental results will differ by as much as an 
order of magnitude. However, we are interested merely in a qualita- 
tive picture. It can be seen that the conductivily is proportional to 
the number of free electrons. This number and the length of the free 
path may vary from substance to substance. 


Example. If the length of the free path of an electron in a metal is 1 ~ 


~ 10 A = 1077 cm and v is of the order of magnitude of 108 cm/sec (see the 
example on p. 682), the free path is traversed in a time t = 107! sec. 
Assume that the voltage drop along a 1-cm segment of a metal conductor 
with a 1-cm? cross-section is equal to 0.003 v = 10-5 Gaussian unit. Then, 
E = 107-5 Gaussian unit and the velocity of the ordered motion of an electron 


is u aieeh 5 x 10-3 cm/sec. The current density is 7 = neu ~ 1073 x 
mv . 


* 4.8 * 10730 x 5 x 1073 ~ 30 * 10'!0 Gaussian unit= 100 a/em?. This 
yields quite reasonable values of conductivity: 


o ~ 25 x 10!5 Gaussian unil = 28 x 104 ohm-! cm-!. 


If a crystal had an ideal lattice and the temperature approached 
absolute zero, there would he no restriction on the length of the free 
path and the material would have no electrical resistance. The elec- 
tron range is limited by atomic thermal vibrations and the presence 
of various crystal imperfections. Both factors disturb the ideal peri- 
odicity of the field in which an electron moves and result in the 
scattering of electrons. It follows that the conductivity of a body 
improves as its temperature decreases and approaches a limit which 
is determined by the degree of perfection of the crystal lattice. 

It can be shown experimentally that the resistance of a metal de- 
creases with temperature. This would indicate that the theory is val- 
id for metals. Moreover, the fact that electrical resistance decreases 
with temperature is an essential characteristic of metals. The plas- 
tic deformation of a metal, the impairment of its lattice by nuclear 
bombardment, and in general any action serving to damage the lat- 
tice will reduce the length of the free path and therefore result in 
increasing the electrical resistance. 

In Part I (p. 222), the thermal conductivity of gases was discussed. 
It was shown that the thermal conductivity of a gas is propor- 
tional to the length of the free path and is given by the formula 
% ~ pvlep. Is this formula useful for the calculation of the thermal 
conductivity of metals? Electrons are much lighter than atoms and 
one is justified in assuming that heat is transmitted by electrons 
which transfer energy from one atom to another. Since the length of 
the free path is not known, one cannot calculate the coefficient of 
thermal conductivity. However, it should be noted that the ratio 
of the coefficient of electrical conductivity to the coefficient of ther- 
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mal conductivity does not contain unknown parameters and depends 
only on universal constants and temperature: 

Za 
—=const T 
0 


(Wiedemann-Frans formula). Experimental resulls agree fairly 
closely with the value obtained by means of this formula. The follow- 


ing table gives the values of the quantity a at O°C for a number 
of metals. 


Metal Au Zn 


wxXohm 


x 108 = 2.31 | 2.35 | 2.23 | 2.61 | 2.47 | 2.51 | 2.52 | 2.31 
of deg? : 


The theoretical value of this quantity is equal to 2.45 x 10°. 
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Since a crystal always has a considerable number of imperfections, 
it will usually possess a residual resistance, which is reached at 
a temperature of several degrees Kelvin, i.e., below this point the 
resistance does not decrease with temperature. However, there exist 
about ten metals which behave quite differently. At definite tempera- 
tures close to absolute zero, such metals completely lose their 
electrical resistance. When an electric current is induced in such 
a superconductor, the current will flow in the circuit for days. This 
shows that the resistance has not simply decreased, but has dropped 
abruptly to zero. 

Of the pure metals, niobium has the highest temperature (9°K) 
and hafnium the lowest (0.3°K) at which superconductory proper- 
ties appear. 

It might seem that superconductivity is a property common to 
all metals, i.e., if the temperature is reduced sufficiently super- 
conducting properties will appear. This is apparently not so. The 
temperature of numerous materials has been reduced down to 0.03°K 
without superconducting properties being manifested. The suppo- 
Sition that such properties are not universal is supported by the fact 
that superconducting metals occupy a definite part (the middle) 
of the Mendeleyev periodic table. 

Superconducting materials include, in addition to pure metals, 
numerous alloys of such nonsuperconducting metals. Moreover, 
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a chemical compound may be a superconductor even though neither 
of its components is one. Thus, copper sulphide is a superconductor, 
but copper and sulphur are not. Niobium nitride already reveals 
superconducting properties at 30° above absolute zero. 

The disappearance of electrical resistance al a temperature 7, is 
not the only peculiarity of superconductors. ; 

Another mark of a superconductor is its characteristic behaviour 
in a magnetic field: generally speaking, a magnetic field penetrates 
such a conductor to a depth of only about 1,000A. Tf very thin films, 
the behaviour of which is somewhat peculiar, are left out of consid- 
eration, we may make the following generalisation: the magnetic 
field inside a superconductor equals zero. 

However, this is true only as long as the applied external field 
does not exceed a certain critical value H,. When this value is ex- 
ceeded, the superconducting state disappears—the magnetic field 
penetrates the material and electrical resistance is restored. 

H,, is a function of temperature, i.e., it is not a constant. At the 
temperature 7;,, a very weak external field suffices to destroy the 
superconducting state. Generally speaking al 7’ = 7, the critical 
intensity H, equals zero. H, gradually increases as the temperature 
is decreased, and at absolute zero it reaches its highest valuc. For 
example, the maximum value of the critical field inlensily of mer- 
cury (T;, = 4.2°K) is 412 oersted. 

Electrical resistance is due to the scatlering of electrons by the 
thermal waves of atoms in a crystal lattice. These thermal wayes 
exist, as we know, because of the presence of a zero energy, even at 
absolute zero. It would seem, therefore, that electrical resistance 
should not disappear no matter how much the temperature is 
decreased. 

How is it possible then to have thermal scattering of electrons 
and no resistance to electric current? This problem was nol solved 
until 1957 when it was proved by means of quantum mechanics that 
electrons in a thin energy layer next to a Fermi surface are able to 
become “paired” thanks to interaction with the thermal vibrations 
of a crystal lattice. 

It transpired that at low temperatures it is advantageous from 
the energy viewpoint for two electrons of equal spin magni- 
tude but opposite spin direction lo become “united”. The words 
“paired” and “united” have been placed in quotation marks 
because calculations indicale that the wave functions of these elec- 
trons extend over a large distance, viz., of the order of 10-4 cm (the 
size of a crystal grain in an ordinary polycrystalline metal). There- 
fore, the formed pairs should not be viewed as peculiar “molecules”, 
since the bond is implemented over a large distance by means of 
thermal waves. 
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lf follows from the theory that all electron pairs are identical in 
the sense that they have the same total momentum. : 

“Matter” consisting of such electron pairs possesses superconduction 
propertics. The formation of electron pairs does not eliminate the 
thermal scattering of electrons. Superconduction occurs because the 
scattering of the electrons of a pair ceases to alfect the current 
strength. Thermal scattering can only break up one or another pair or 
form a new pair from separate electrons, but the magnitude of the 
current is determined by the total momentum of the electrons, 
which remains unchanged. Thus, according to this model, the ther- 
mal sealtering of electrons may produce electric current fluctuations, 
bul it cannol stop the current. 

A superconductor contains, in addition to “paired” electrons, an 
ordinary electron gas, i.e., a gas of individual electrons. Thus, in 
a superconductor, there exist, so to speak, two fluids—an_ ordinary 
fluid and a superconducting one. If the temperature of a superconduc- 
tor begins to rise from absolute zero, thermal motion will break 
up more and more pairs of electrons, i.e., the ratio of the ordinary 
electron gas Lo the superconducting electron gas will increase. Final- 
ly, the critical temperature is reached and the last electron pairs 
break up. 

The new theory provides a quantitative explanation of all of the 
superconduction phenomena discussed above. 


. 
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Properties. A large group of substances (various elements and 
chemical compounds), the conductivity of which lies in the broad 
interval between those of conductors and insulators, are classified 
under the heading of semiconductors. Since a voltage of 1 volt will 
produce currents of several hundred thousand amperes in a cube of 
metal having a volume of 1 cm?, and currents of the order of 10-1 
amp in insulators under the same conditions, one can see that the inter- 
val occupied by semiconductors is extremely large. 

The conductivily of substances in this interval has a number of 
peculiarities which enable us to “recognise” semiconductors. 

First, it should be noted that the dependence of conductivity on 
temperature is opposite to that of metals. The conductivity of semi- 
conductors, in contradistinction to that of metals, may decrease 
rapidly with temperature. At low temperatures, a semiconductor 
may become an insulator. The resistance of most semiconductors 
is considerably more sensitive to changes in temperature than met- 
als. Compact temperature meters of high sensitivity may be con- 
structed using semiconducting thermal resistors (thermistors). 


44—1409 
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A second important feature of semiconductors is that in a number 
of cases they may possess positive (p) as well as negative (m) conduc- 
tivity. The terms positive and negative are used in the following 
sense: if the current is due to the motion of positive charges the con- 
ductivity is called positive and if it is due to the motion of negative 
charges it is called negative. Thus, metals have negative conductiv- 
ity since the current is due to the motion of electrons. Both kinds 
of conductivity occur in semiconductors. This effect seemed strange 
at first since the flow of 
current in a semiconduc- 
tor is not associated (as 
in an electrolyte) with the 
displacement of ions, and 
the question of the na- 
ture of positive carriers of 
current remained open for 
a period of time. 

The sign of the current 
carrier may be deter- 
mined in a number of 
ways. Let us examine the 
most convincing evidence, 
which is based on a study 
of the forces exerted by 
a magnetic field on current-carrying particles (Hall effect). If an 
electric current flows along a plate which is placed perpendicular to 
magnetic lines of force then, on a charged particle e moving with a 
velocity u, a force F will be exerted in a direction perpendicular 
to the field and current (see Fig. 300). In other words, an electric 
field of intensity E = uB will be created in such a direction (see p. 297). 

A potential difference U = uBd is produced between the plate 
faces perpendicular to the created field. The sign of this potential 
difference is determined by the sign of the charge carrier. 


Fig. 300 


Example. Consider a 1 X 2 X 0.5 cm® semiconducting ne in a magnetic 
field B = 1,000 gausses. Assume that the conductivity o of the plate is equal 
to 3 ohm™ cm™ (zine oxide). If a potential difference of 1 volt is applied be- 
tween the plate faces which are separated by a distance of 2.cm, an electric 


current of density j = oH = 3 X + = 1.5 a/cm? will flow through the plate. 


Experiments show that a potential difference U = 0.12 mv is produced 
between the lateral surfaces of the plate. The sign of the Hall effect (see 
Fig. 300) indicates that the charge carriers are electrons. The velocity of their 
ordered motion is 

: 3 
pile meme 0D SORE We) m/sec = 12 cm/sec. 


Bd 0.4 Vv sec 5 10-2m 


m2 


iS) 
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Note that this velocity is more than 1,000 times as great as the velocity of the 
ordered motion of conduction electrons in a metal (see the example on p. 686). 
The number of conduction electrons per unit volume of semiconductor is 


tN Salem? gy. jo7 ems 
au 1.6% 10-9 Cx 12 cm/sec =S x doreon 


The low value of o is due to the fact that this n is about a millionth of that 
of a metal. 


A final important feature of semiconductors is their extreme sen- 
silivily to impurities, which not only affect their conductivity (an 
impurity of the order of one per cent may change the conductivity 
of a semiconductor by a factor of a million or more), but may 
change n-conductivity into p-conductivity and vice versa. 

The most important semiconductors already of practical signifi- 
cance include germanium, silicon, selenium, antimony alloys (con- 
taining indium, cadmium and zinc), and copper and titanium oxides. 

Interpretation of Properties. Most characteristics of semiconduc- 
tors can be easily explained by means of an energy level diagram- 
Insulators have a filled energy band. The next unfilled band is sep- 
arated from the filled band by a large energy gap. Imagine that the 
system of levels of a substance is such that the gap between these 
bands decreases and the energy of thermal motion suffices to transfer 
electrons from the filled band to the unfilled band. Such a substance 
will act as a natural semiconductor. 

At a given temperature, the number of electrons in the upper band 
will be determined by the dynamic equilibrium conditions estab- 
lished between bands. Electrons continuously pass from the lower 
band to an excited state and vice versa. Just as in the case of a sat- 
urated vapour, equilibrium will prevail when the number of elec- 
trons moving “upwards” equals the number of electrons moving 
“downwards”. 

Again as in the case of a saturated vapour, when the temperature 
is raised the equilibrium is displaced in the direction of the upper 
level, i.e., the instantaneous concentration of electrons in the upper 
band increases. The concentration of free electrons rises sharply 
with the increase in gap between bands. The probability of sur- 
mounting an energy barrier of width A@ invariably appears as an 
exponential factor. The approximate concentration of electrons in 
the upper band at a temperature 7 may be determined from the 

sé 
formula n ~ 10% x e 2AT, 

If a body has a gap A@ which is significantly greater than k7, it 
belongs under the heading of insulators. For this purpose, it is suf- 
ficient for ¢ to be 100-200 times as great as kT. At room tempera- 


44* 
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ture, kT me ap ev. When A@ becomes less than 1 ev, i.e., about 
40 times as great as k7', the number of electrons in the upper band 
will be sufficient to create measurable currents. When A@_ is of 
the order of several tenths of an electron-voll, a semiconductor pos- 
sesses considerable conductivity. 

Reference to the conductivity formula o = at shows that when 
the temperature of a semiconductor changes two factors on which 
o depends also change. In the first place, as the temperature increa- 

ses, the number of free electrons 
<— Field n increases, but, as previously, 
—> Movement of electrons the free range 1 decreases with 
increasing temperature. However, 
experiments show that usually 
the former effect overshadows the 
latter. 
Until now, we have discussed 
the conduction properties of the 
b) upper band, but have disregarded 
the lower band, which should 
also acquire conduction proper- 
ties since vacancies are formed in 
it as the result of the transition of electrons to the upper band. This 
conductivity may be very peculiar in nature. 

The creation of conductivity in the upper, partially filled band 
may be interpreted as a displacement in the distribution of electrons 
in momentum space in the direction of the field (to the right in 
Fig. 301a). However, this is not the only way in which ordered motion 
of electrons may occur. Imagine that the overall shape of the 
distribution of electrons does not change (see Fig. 301b). However, 
now one electron, and now another, close to the Fermi surface is 
snatched away and a “hole” is formed in momentum space. Under 
the action of the field, such a hole is immediately filled by a neigh- 
bouring electron moving from left to right (in the same direction as in 
the other diagram). The hole is displaced from right to left. Now, 
another point representing an electron in momentum space occupies 
this position and in this manner-the hole moves in the opposite 
direction to that of the electrons. Since holes are formed continually, 
a continuous positive “hole” current flows. 

Thus, in a natural semiconductor, electric current may be viewed 
as the result of the motion of “holes” in the occupied band as well 
as of electrons in the upper band. However, the major role in such 
ses is played by the motion of electrons in the conduction 

and. 


Fig. 301 
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This natural conduction of semiconductors is encountered consid- 
erably less frequently than another phenomenon, namely, semi- 
conduction properties under the action of a small percentage of 
impurities. The role of foreign atoms or other lattice imperfections 
consists in their contribution to the system of energy levels. Fre- 
quently, imperfections create their own level—a narrow energy band 
between the filled and unfilled bands. 

Let us assume that the foreign atoms contribute “surplus” elec- 
trons which occupy a narrow band between the main bands. When the 
temperature increases, 
electrons pass from the 
level of the impuri- 
ties to the conduction 
band in greater and great- 
ernumbers, i.e., conduc- 
tion increases. Such a 
semiconductor yields 7- 
conduction. A point may 
be reached (for a low per- 
centage of impurities) at 
which all surplus elec- 
trons are given up. A fur- 
ther increase in tempera- 
ture will not result in an 
increase in conduction 
and from then on the 
body will act like a met- 
al. Such behaviour may 
be detected when penta- Fig. 302 
valent arsenic or phos- 
phorus atoms are introduced into a lattice of quadrivalent silicon 
or germanium. Fig. 302 shows a simplified diagram of the crystal 
lattice of silicon. If a silicon atom is replaced by an arsenic atom, a 
“surplus” electron is obtained. This will be a conduction electron. 

It is interesting that impurities may result in p-conduction. This 
occurs when an impurity atom has acceptor properties, i.e., can 
attract electrons. Electrons pass from the filled band to the inter- 
mediate level of the impurities and as a result hole conduction occurs 
in the filled band. Such conduction occurs in silicon containing 
a trivalent aluminium impurity. If at a number of lattice sites 
Silicon is replaced by aluminium, “electron-deficient sites” will 
exist in the crystal. When a field is applied, an aluminium atom may 
attract an electron from a neighbouring silicon atom; the electron 
falls under the action of the electric field and a “hole” moves in the 
Opposite direction. 


- 
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It should be noted that such “unsophisticated” models of the dis- 
placement of an electron are greatly oversimplified in veiw of the 
fact that the motion of electrons in a solid satisfies the laws of quan- 
tum mechanics. 

By adding one or another impurity, we are able to vary, the con- 
ductivity of materials within very broad limits. We may change 
p-type conductivity into n-type conductivity and may significantly 
change the nature of the temperature dependence of the conductivity. 
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Work Function of an Electron. Electrons in a conduction band 
behave like an electron gas. The surface of a solid acts as the “walls” 
of the vessel in which this gas is 
zocated. To leave the bounds of 
this surface, an electron must 
surmount a potential barrier the 
height of which is designated by 
G6. At absolute zero, electrons 
have a limiting energy W. In the 
model of an electron gas, W cor- 
responds to a Fermi surface. This 
is the energy of electrons which 
at absolute zero are located at 
the highest level. Thus, in order for an electron to surmount the 
potential barrier, it is not necessary to impart to it an energy @; 
it is sufficient to give it an additional energy 

A=€@—W. 
The quantity A is called the work function and 4 =q the potential 


function, i.e., the work function expressed in volts (see Fig. 303). 

An electron’s escape from a metal is impeded by the forces of 
attraction exerted by positive ions as well as the force of attraction 
between the electron and its electrical image. The latter force is 


Fig. 303 


2 
equal to As , where z is the distance of the electron from the surface. 


This force is capable of holding an electron at a considerable distance 
from the surface and thus forming a layer or electron cloud close to 
the surface of the body. 

If the metal is located in an electric field, the work function de- 
creases by an amount eV eE, where ¢ is the electron charge and E is 
the field intensity. The intensity of the external field must equal 
0.2 X 10° v/cm if the potential function is to decrease by 1v. (Thus, 
in ordinary electronic instruments the external field has little effect 
on the work function.) 
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The work function is very sensitive to changes in surface properties. 
It transpired that it is possible to deposit electrically positive atoms 
or ions (metals, oxygen) on the surface of a cathode. In this manner, 
a layer of positive charge may form on such a surface. Thus, as may 
be seen from the following example, the work function can be con- 
siderably reduced. : 

The filament of the heater in electron tubes is usually made of 
tungsten (work function A = 4.9 ev). By coating the surface: of the 
tungsten with a layer of an oxide 
of an alkaline earth metal (Ca, 
Ba, Sr), we may decrease the work 
function to 1.5-2 ev. This enables 
us to obtain considerably greater 
emission at lower filament tem- 
peratures. 

Thermionic Emission. The escape 
of electrons from a metal upon 
heating is known as thermionic emis- 
sion. This phenomenon is basic to 
the action of heater-cathode tubes. 
When the temperature is raised, 
electrons are excited, some of them 
acquring a sufficient velocity in the 
direction perpendicular to the sur- 
face of the material to surmount the potential barrier @. 

An electron gas obeys Fermi-Dirac statistics, according to which 
the number of electrons having an energy @ is proportional to the 


Fig. 304 


: 4 . A F ‘ 
expression —g-y— - But we are interested in the energies @ which 
e kT { 
are considerably greater than the zero-level energy W. Therefore, it 
is accurate enough to reduce the above expression to 


_W-G -% 
aie: een ee 


Thus, we may determine the number of electrons having an energy 
equal to the height of the potential well. It may be rigorously proved 
that the thermionic emission current is proportional to this expres- 
sion. We see from the formula that thermionic current increases ex- 
tremely rapidly with temperature. 

The circuit shown in Fig. 304 may be used to measure the thermionic 
current. By increasing the voltage, we quickly reach saturation 
current. (This is the thermionic current referred to above.) The ini- 
tial portion of the current-voltage curve is space-charge limited 
(see above). The voltage draws electrons to the plate from the elec- 
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tron cloud and the cathode emits enough electrons to maintain the 
cloud in equilibrium. In the absence of an external voltage, this 
equilibrium is determined by the electron emission of the cathode 
and the negative potential of the cloud. As the voltage is increased, 
the electron cloud begins to dissipate and emission increases until 
the voltage draws all electrons of the cloud to the plate. At this 
point current saturation sets in. 


Rigorous analysis yields the following relationship hetween electronic 
current and temperature: 


= 6@" 
j=AT2e "T (Richardson formula). 


For tungsten, A = 75 a/cm? deg? and ep = 4.5 ev. Let us compare the 
ase soy of thermionic emission form tungsten at 500°K and 2,000°K . 
t ’ 


_ 4.5x1.6x 10712 
: —-16 
j= 75 x 25 x 10! xe 1°88x10°%XK500 _ 49-36 a ome, 


In other words, to obtain measurable currents, cathodes of impracticable 


dimensions (greater than that of the entire land mass of the globe) would be 
required. 


At 2,000°K, 
_ 5.21x104 
7=7T5X4X108xe 2000 4g mayem?. 

Such a current is easily measured, but the area of the emitti f is sti 
too large for most practical purposes. ats saa ais 

The picture changes when tungsten is coated with caesium. Now, A = 
= 3.2 a/cm*deg? but ep = 1.36 ev. At T= 2,000°K , 

_ 1.57x104 
T=3.2X4X108xe = 2000 48 x 103 a/em?. 

Clearly, such current densities would destroy the cathode. Therefore, the 


required values of current density, 7 <1 a/cm?, are attained at lower tem- 
peratures (~1,300°K). 


Secondary Emission. This refers to emission due to the dislodge- 
ment of electrons from a metal under the action of other electrons. 
Secondary electrons may emerge, taking the same direction as the 
primary electrons. This shows that primary electrons interact with 
bound electrons; otherwise, the law of conservation of momentum 
would be violated. Secondary emission begins at a primary electron 
energy of the order of 10 ev. Most secondary electrons have an energy 
of several electron-volts—their energy distribution is practically 
independent of the energy of the primary electrons. . 

Primary electrons produce secondary electrons and, in addition, 
are elastically scattered. The remarkable phenomenon of one pri- 
mary electron producing several secondary electrons has wide appli- 
cation (e.g., Kubetsky tube). 
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Extrinsic Photoeffect. This phenomenon may be studied by mak- 
ing the cathode of a vacuum tube out of the material under inves- 
tigation. Light falling on the cathode dislodges electrons from the 
material. Electrons reaching the anode generate a_ photoelectric 
current, the magnilude of which may be studied as a function of 
external conditions. 

It is the total current taken from the cathode, of course, that is 
characteristic of the substance. Here too, therefore, we usually oper- 
ate under conditions of saturation current. If no voltage is applied 
across the photocell, a weak current generated by electrons leaving 
the cathode in the direction of the anode flows through the instru- 
ment. A weak accelerating voltage is not sufficient to attract all the 
electrons, but at a certain voltage all the electrons reach the anode 
and saturation current is obtained. : 

Experiments show that the photocurrent is strictly proportional 
to the intensity of the incident light. This is true of light of any 
frequency which produces a_photoeffect. 

Moreover, the number of dislodged electrons is exactly equal to 
the number of photons. One photon may dislodge only one electron. 
It is not possible for a photon to dislodge several electrons from 
a substance by a series of energy losses. This important postulate is 
somewhat difficult to prove in a study of the extrinsic photoeffect, 
since the extrinsic photoeffect may be accompanied by an intrinsic 
photoeffect (see below) in which some of the electrons do not leave 
the bounds of the substance. 

The law of conservation of energy and the law of conservation of 
momentum are obeyed in the interaction of a photon and an elec- 
tron. The law of conservation of energy (the Einstein equation) takes 
the form 

mov2 


hv = 5 +e, 


where ¢ is the potential function of the electron for the metal (the 
same as in the thermionic emission experiments). In accordance with 
the law of conservation of momentum, it may be assumed that the 
lattice of the metal takes part in the photoelectron interaction proc- 
ess (otherwise electrons could move only in the same direction as 
the photons). 

A photon may produce a photoeffect if its energy is not less than 
the work function. It follows that each material has a photoeffect 


limit. The limiting frequency vo is equal to —9 and the limiting 
wavelength Ao (in millimicrons)—the “red” limit of the photoeffect— 
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ep ; “ : 
take place when a substance is irradiated with light of long wave- 


length. The “red” limit of the photoeffect in the case of Li is Ay = 
= 560 millimicrons (5,600 A), i.e., it is in the yellow region of the 


visible spectrum; in the case of Cu, 49 = 300 millimicrons (3,000 A), 

i.e., the ultraviolet region; and in 

Vy the case of Al, Ao = 410 millimic- 

rons (4,100 A), i.e., the violet re- 

gion of the visible spectrum. 

" If the energy of a photon is great- 

arctan © er than the work function, the 

surplus goes into the kinetic energy 

of the electron. Thus, hard radia- 

tion can produce very fast photoe- 
y ree 

: *'In order to measure exactly the 

Fig. 305 limiting frequency and the work 

function, we use a retarding poten- 

tial method. A small bias voltage is applied to the photocell (plus 

terminal is connected to the photocathode) and this voltage is in- 

creased until the current cuts off. This point is reached when 

Vea. In this manner, we may determine experimentally the de- 
pendence of V; on the frequency of light: 


, where » is expressed in volts. No photoceffect will 


hv 
Vp = Se 


The plotted curve is a straight line the slope of which is equal to 
the universal constant ~~. The limiting frequency vo and the poten- 
tial function @ are obtained as intercepts along the abscissa and ordi- 
nate, respectively (see Fig. 305). 

Ezample. If soft X-rays of wavelength 4 = 100 A fall on a copper plate 
(p = 4.1 v), the bias voltage cuts off the photocurrent when 
3 x 1010 1 

0-5 * Tex 10-12 
Therefore, the bias voltage will equal 120 v. 


eV, =hv—ep=6.6 x 10-2? — 4.1 ev= 120 ev. 


Another important characteristic of a photocathode material is 
the spectral dependence of the photocathode. Here, no simple rela- 
tionship may be applied. The curve begins at the limiting frequency 
and in many cases increases rather uniformly; one can say that the 
coefficient of utilisation of photons increases with photon energy. 
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However, in other cases, the spectral curves have well defined maxima 
which lie within a rather narrow spectral band. 

Photocells which utilise the extrinsic photoeffect have broad appli- 
cation. They are used in photo relays, television and cinema sound 
tracks. Silver, caesium and potassium may serve as photocathodes; 
antimony-caesium cathodes are widely employed. 

In various photo relay applications, the photocurrent need not be 
proportional to the intensity of light, but the sensitivity of the 
photocell should be high. In such cases, gas photocells instead of 
vacuum photocells may be used. This increases the sensitivity tens 
of times. 

Intrinsic Photoeffeet. When the action of a photon results in the 
displacement of an electron from a filled band to a level of an impu- 
rity or to a conduction level, it is referred to as the intrinsic photo- 
effect. Under the action of light, this phenomenon may produce con- 
duction electrons and holes in a body. Such conduction electrons and 
holes will occur in pairs. Strictly speaking there will be a pair of 
charges for each photon. The phenomenon is extremely complicated 
by secondary processes which occur within a body as a consequence 
of the recombination of electrons and holes. 

It is clear, therefore, that the intrinsic photoeffect is a phenomenon 
which is particularly. characteristic of semiconductors, but which 
may also occur in insulators. 

Semiconductors which possess this effect are included in current 
circuits as photo resistors. In the dark, such a body has very low 
(dark) conduction. Its conduction increases when subjected to light. 
Energies of several tenths of an electron-volt may be sufficient to 
produce intrinsic electron transitions. Therefore, the threshold of 
the intrinsic photoeffect may lie in the far infrared region. 

Photo resistors are widely used in signal systems and in automation 
when it is necessary to amplify or detect very small changes in 
light intensity. 
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In a number of cases, the junction between a metal and a semicon- 
ductor or between two semiconductors may have a rectifying action. 
Even though the boundary between two bodies in close contact (weld- 
ed or fused) is very narrow, nevertheless it is of finite width; hence 
the designation barrier layer. Such a layer may form at the junction 
between copper and cuprous oxide or at the junction between sele- 
nium and cadmium selenide. Investigations indicate that a barrier 
layer between two semiconductors is formed when one of the semi- 
conductors is a p-type conductor and the other an n-type. Such 
barrier layers are called p-n junctions. 
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Fig. 306 illustrates the rectification provided by harrier layers. 
The figure shows a typical current-vollage curve. The right branch 
of the curve is the characteristic for the forward current and the 
left branch for the reverse current. The forward current increases 
rapidly with increasing voltage, but the reverse current remains 
almost constant and has a very low value. 

Which is the direction of forward current? Investigations indi- 
cate that at a p-n junction the forward current flows from the p-type 

semiconductor, through the junction, 

#1 to the n-type semiconductor. This 
means that holes move toward elec- 
trons and electrons move toward 
the higher concentration of holes. 
Reverse current flows when the 
holes and electrons move away from 
the junction. 

In the case of a metal-semicon- 
ductor junction, the situation may 
be described as follows. If the metal 

-V *V forms a junction with a p-type sem- 
=f iconductor (copper-cuprous oxide), 
the forward current will flow from 

Fig. 306 the cuprous oxide to the copper. 

This is to be expected: There are 

no free electrons in a semiconductor, but in a metal there are 
an excess of such electrons; therefore, electrons move from the metal 
to the semiconductor. 

The characteristics of barrier layers find wide application in indus- 
trial rectifiers. Copper-oxide rectifiers (copper-cuprous oxide) and 
selenium rectifiers have been produced for a long time. During recent 
years, tiny germanium and -silicon rectifiers—crystal diodes—have 
been widely introduced. The introduction of impurities into ger- 
manium or silicon may transform them into p-type or n-type con- 
ductors. 

A crystal diode consists of a very small germanium (or silicon) 
crystal, one part of which contains an acceptor-type impurity and 
the other a donor-type impurity. 

Also of interest are crystal triodes, representing a semiconductor 
system of the p-n-p or n-p-n type. If a wire is soldered to each of the 
three sections of such a tiny triode (the dimensions of crystal “tubes” 
are of the order of a centimetre), the system may be connected in 
a circuit just as an ordinary triode tube. A voltage is applied across 
the two outer ends, one end serving as the anode and the other as 
the cathode. The third tap serves as the grid. Such a system of semi- 
conductors has two barrier layers, connected in opposition. This 
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is why it behaves like a triode tube. The analogous operation of 
crystal triodes and ordinary triodes is illustrated in Fig. 307. 
Another important use of barrier layers is in the manufacture of 
photocells, which operate without a voltage source. Coating a semi- 
conductor with a thin ‘layer of metal will produce a barrier layer. 
A layer of metal may be made so thin that light easily passes through 
it. When light passes through the metal, an intrinsic photoeffect 
is produced in the semiconductor. The presence of a barrier layer 


Collector 
\ 


Vacuum triode 


Semiconductor 
triode 
Fig. 307 


causes the liberated electrons to move in a definite direction. An 
electric current will flow when the circuit is closed. 

Copper oxide and selenium photocells are manufactured on the 
basis of this principle. Sulphur-thallium photocells, having a short- 
circuit current of the order of 10,000 microamperes per lumen, are 
at present being used in the Soviet Union. They have an efficiency 
of transformation of light energy into electrical energy of the order 
of 4 per cent. Here too silicon and germanium p-n barrier layers 
are of fundamental significance. They enable us to manufacture 
photocells with an efficiency of the order of 10 per cent. This new 
discovery has placed the utilisation of solar energy on a _ practi- 
cal basis. 


276. Contact Potential 


When two metals or semiconductors are in contact, there arises 
a difference of potential between them. This difference is known as 
contact potential. To measure this difference, we must remove 
inclusions, oxide films, etc., and make close contact between the 
surfaces of two such bodies by soldering, welding or pregrinding. 
In this manner, we may form a circuit that is broken in one place. 
Since all points of the body are at the same potential, the contact 
potential may be determined by measuring the field in the gap. 
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The order of magnitude of the contact potential between two bod- 
ies is equal to several tenths of a volt. 

The existence of contact potential is easily explained. It is due 
to the difference in work functions of the two adjacent bodies. It 
should be recalled that the work function A’ is equal to the differ- 
ence between the energy W of an electron at the highest level inside 
a metal (at absolute zero) and the energy of an electron escaping 
from the metal with zero kinetic energy. 

The energy distribution of electrons in the case of two solids 
placed in contact can be represented as shown in Fig. 308. The 
upper level is the same for all 
bodies. The electrons of the body 
having a lower work function 

are at a higher energy level. 
Thus, the conditions are present 
.for the transition of electrons 
from the first body to the second. 
This transition is accompanied by 
the formation of positive charge on the first body and negative charge 
on the second. At the point of contact, there arises an electric 
field which impedes the transition of electrons. Finally, equilib- 
rium will be established at a particular potential difference char- 
acteristic of the given pair of metals. 

This picture depends little on temperature. As the temperature is 
raised, the energy distribution boundary of the electrons is no long- 
er so sharply defined. Electrons appear at higher levels, but the 
conditions for the transition of electrons remain basically the same 
thanks to a lack of close dependence of the energy of the electrons 
on temperature. 

It is evident from the above description of this phenomenon that 
any group of solids may be arranged in a definite sequence such that 
each member of the sequence becomes positively charged with 
respect to the next. Such a series was first obtained by Volta, the 
discoverer of contact potential. From the above explanation of 
this phenomenon, it is clear that a Volta series corresponds to rising 
work functions, i.e., the motion of electrons between two bodies 
is in the direction of the one having a higher work function. 

Since the contact potential g,z between two bodies may be ex- 
pressed in the form of a difference of work functions, 


Fig. 308 


1 
P12 = = (Ai —A2), 


it is evident that the potential difference between two bodies may 
be expressed as the difference of the contact potentials between 
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each of these bodies and a third: 


4 1 
ars (Ay—A2), Vis =e (Ay — As); 


1 
De aaa (Ao — As) = Pai — Pai- 


Furthermore, it is evident that in a closed circuit consisting of 
any number of bodies the total contact potential is equal to zero: 
Pi2 + P23 + Pai = O. No current will flow in such a circuit. 


Electromotive Series 
(normal potential in an electrolyte solution, volts) 


Li Ca Na_ Al Zn Fe Ni Pbh Cu Hg Ag_ Pt 
—3.01 —2.84 —2.71 —1.66 —0.76 —0.44 —0.23 —0.12 +0.34 +0.70 +0.80 41.2 


277. Charge Distribution in a Nonuniformly 
Heated Body 


Let us consider a rod along which a drop in temperature occurs. 
Different portions of the rod will be subject to different conditions, 
and this will affect the behaviour of the free electric charges. Where 
the temperature is higher the charges will have greater energy; 
moreover, the number of free charges may increase if electrons can 
pass from the filled band to the conduction band. Both effects tend 
to produce diffusion of free charges, which continues untill a field 
which counterbalances the tendency to uniform distribution is creat- 
ed. A drop in potential will occur along the rod; negative charge is 
formed at one end of the rod and positive charge at the other. Each 
body has its own characteristic curve of potential drop as a function 
of temperature. The rate of fall of potential may be described by the 
derivative of the potential with respect to temperature: 


If a constant temperature difference is maintained between the 
ends of a rod, heat is transferred continuously through the rod. 
Heat is transferred by free charges, but current cannot flow in an 
open circuit. The continuous transfer of energy without the transfer 
of charge is achieved thanks to the different velocity of the charges 
moving from the hot end to the cold end, while the number of charges 
passing through a given cross-section per unit time is the same in 
each direction. If electrons are the carriers of current, an excess con- 
centration of them is produced at the cold end of the rod. If positive 
particles or holes are the carriers of current, positive charge accumu- 
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lates at the cold end. Thus, the sign of the potential difference will 
differ, depending on the sign of the current carriers. 

Will the above effect obtain in the case of a semiconductor, partic- 
ularly one in which holes as well as electrons are the carriers of 
current? As a matter of fact, such bilateral diffusion may reduce to 
zero the potential difference of a nonuniformly heated body. However, 
the potential differences formed by posilive and negative current 
carriers may not balance each other. This may occur as a result of 
a difference in mobility between electrons and holes, and also as 
a result of differences in their concentrations. 

Certain difficulties occur in detecting potential differences in 
a nonuniformly heated conductor. The order of magnitude of such 
a potential drop is 10-* v/deg. This effect cannot be detected, of 
course, by forming a closed circuil of the conductor in the hope of 
measuring electric current. Such a closed circuit may be conceived 
of as divided into two halves: in one half a potential drop occurs 
and in the other a potential rise. In a uniform conductor, the mag- 
nitudes of these two potentials will be exactly equal; hence the 
emf we wish to measure will not be detected. 


278. Thermoelectromotive Force 


Electric current will flow in a wire ring consisting of two (or more) 
different materials if the junctions have different temperatures. 
This is the well-known thermoelectric effect, which has found broad 
practical application. 

There are two possible reasons for the flow of a thermoelectric 
current. First, it is evident that the potential drops along the two 
wires due to temperature drop may differ if the values of the constant 


d : : 
a= oe differ for the two materials (we shall designate them as 


[ and II). 
T2 Ty 
Thus, the potential differences \ a dT and an dT generally 


T Ts 

are not equal. This alone would be sufficient for an emf equal to- 
the difference between these voltages to arise in the wire ring. 

The second reason for thermoelectric current lies in the fact that 
contact potential quite probably depends on temperature. If the 
two junctions are placed at different temperatures, their contact 
potentials may differ. Again, this condition alone would be suffi- 
cient for a net potential difference to exist in the closed circuit and, 
hence, for a current to flow. 

Taking both phenomena into account, we may express the ther- 
moelectromotive force as the sum of the voltage drop in the first 


, 
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wire, the jump in potential from the first wire to the second, the 
potential drop in the second wire, and the jump in potential from 
the second wire to the starting point of the circuit: 


Ta Ty 
é= \ ay aT + [yyy (T2) — G1 (1 2)] + \ ayy AT + [1 (21) — Gui (11))- 
T Ta 


To simplify the above expression, let us write the dilference 
(py (Ls) — gir (71) in the form 


and the analogous second difference in a similar form. Now, the for- 
mula for the emf assumes the form 


qT d¢ of dy 
os \ — dT — § (an — pt) ar. 
T1 Ty 


Thus, we have succeeded in expressing ¢ in the form of a difference 
between two quantities, each of which is characteristic of a given 
body. Quite often the term “thermoelectromotive force” is used to 
refer Lo the emf per degree: 
dp 
a=—a WT? 

rather than to the above integral. This quantity is a fundamental 
characteristic of the thermoelectric properties of a body. It is not 
an invariable constant, for it may depend on thermodynamic condi- 
tions, including the temperature. However, for many bodies this 
dependence is not well defined. 

By measuring the thermoelectromotive force, we may determine 
the difference between the above quantities, but we cannot deter- 
mine a. However, by forming different pairs of conductors and semi- 
conductors, we are able to determine the value of a@ relative to 
a material taken as a “base”. Thus, materials may be arranged in 
a series in accordance with their thermoelectromotive forces. For 
reasons which are quite understandable in’ view of what has been 
said, a thermoelectromotive force series does not coincide with the 
corresponding contact potential series. 


Let us list the thermoelectromotive forces of several metals with respect to 
plat’uym. If a given metal is joined to platinum and one junction is held at 
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0°C while the other is held at 100°C, an emf arises in the closed circuit: 


Antimony -—- 4.0 mv 
Iron SEOs, 22 
Copper -~- 0.73 * 
Nickel —-1.5 7” 


Constantan —3.4 ” 


The positive sign indicates that at the 0°-junction current flows from the 
given metal to the platinum. 


Using a table of values of the constant a, one can calculate the 
thermoelectromotive force occurring for a given temperature differ- 
ence from the expression @ = (4, — @2) (J; — T:2). This is the 
expression for a thermal element consisting of two metals or two 
semiconductors the current carriers of which have the same sign. 
In this case, the potential differences arising in the two branches of 
the circuit are in phase opposition, and the resulting emf is equal 
to the difference between the effects of the two conductors forming 
the circuit. However, the picture changes when the circuit is formed 
of two semiconductors, one of which possesses hole conduclivity 
and the other electron conductivity. In a p-type conductor, holes 
move toward the cold junction and electrons toward the hot junc- 
tion. In an n-type conductor, electrons move toward the cold junc- 


ae The two effects reinforce each other and the formula assumes 
the form. 


6 = (44 +g) (T4—T2). 


This fact is of great practical significance. 


279. Liberation of Heat in Electrical Circuits 


Joule heat is liberated in a conductor in which current flows. 
The displacement of charges along a body is accompanied by two 
‘other thermal effects. 

The first of these, the Peltier effect, consists in the following. If 
electric current passes through a junction between two bodies, heat 
proportional to the current strength is released or absorbed at the 
junction: 


Q=Il, 


where II is a proportionality constant. A remarkable feature of this 
effect is that the sign and magnitude of the heat changes when the 
direction of the electric current changes, i.e., depending on this 
direction, a particular junction will release or absorb heat. This 
was demonstrated by Lenz more than a century ago. A drop of water 
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was placed in a recess at the junction between an antimony rod and 
a bismuth rod. Then, by passing current in one direction, he showed 
that the drop freezes. When the current was reversed, the drop 
melted. 

The second effect occurs in any uniform conductor which is heated 
nonuniformly. Assume, for example, current flows along a rod, one 
end of which is maintained at one temperature and the other end 
at another. In such a conductor additional heat, ~ Cold 
proportional to the first power of the current a 
strength (not to the second power as in the case 
of the Joule effect), will be liberated. To detect 
this effect, we must, of course, reduce Joule heat 
to a minimum. 

This effect, which was predicted by the British 
physicist Thomson on the basis of thermodynamic | 
considerations, may be demonstrated in the fol- 
lowing manner. Included in the current circuit H C 
are two bars which are placed parallel to each 
other as shown in Fig. 309. The ends of these 
bars, maintained in pairs, are held at different 
temperatures. It would seem, in view of the sym- 
metry of the arrangement, that symmetrical points 
of the bars should have the same temperature. 

However, in one bar’ the current flows from the 

hot end to the cold end and in the other from Hat 

the cold end to the hot. Owing to the Thomson 

effect, corresponding points of the two bars do 

not have the same temperature. A point of the Fig. 309 

bar in which the current flows from the hot to the 

cold end will be holter than the corresponding point of the other bar. 

The quantity of heat liberated per second in a segment of length 
dx may be written in the form j 


oT 
dQ=tI = dz, 


where t is a proportionality constant. The greater the temperature 
gradient, the greater the quantity of heat. Three effects exist simul- 
taneously in a thermoelectrical circuit: the appearance of a thermo- 
electromotive force, the Peltier effect and the Thomson effect. On 
the basis of the principles of thermodynamics, it can be proved 
that these three processes are interconnected. This requires no proof 
for weak currents. Since the thermoelectric effects are proportional 
to the first power of the current and Joule heat to the second, the 
Joule heat is negligible in such cases. 


45* 
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Example. The ends of a rod of sodium (t = —8.5 107% v/deg and p = 
= 5 «K 107% ohm cm) of 10-cm length and 5-mm® cross-section are maintained 
at temperatures of 300°K and 310°K. 

When a current / = 0.5 ma flows from the hot to the cold end of the rod, 
the heat liberated in the conductor per unit time due to the Thomson effect is 


oT z é , ae 
Qr=tl ae l= —8.5x 10-6 (—5 % 1074) X 1K 104.25 x 108 J/sec. 
The minus sign preceding the current indicates that the current flows in the 


direction of decreasing temperature. The teat liberated in the conductor per 
unit time due to the Joule effect is 


10 é 
— /2R—(5»s ANZ we 5 xe -6 —=2,5x -10 ‘ , 
Qy=lPR= (5X 1074)2 K 5x 10 5c 1078 5x 10 J/sec 
j.e., about = of the value of the Thomson heat. 


200 
Thermodynamic analysis shows that the coefficients a, II and 


t are interrelated as follows: tat and aat, Substitut- 


ing aT for II in the first relation, we obtain: tr. The abso- 
lute value of « may be determined from these equations. 

The Peltier and Thomson effects have the same physical basis as 
thermoelectromotive force. In the final analysis, a thermoelectromo- 
tive force arises from the fact that heat flow transfers electric charges. 
Here, however, we are dealing with phenomena in which a flow of 
electric charges transfers heat. 


’ 


280. Applications of the Thermoelectric Effect 


The opportunities for the application of thermocouples as genera- 
tors of electrical energy have increased considerably in recent times. 
Metal thermocouples have a coefficient of efficiency of the order of 
0.5 per cent, but that of a semiconductor thermocouple consisting 
of a hole segment and an electron segment has already reached as 
much as 7-8 per cent. The low efficiency results from irreversible 
losses in the form of Joule heat. If Ro is the resistance of the inter- 
nal portion of the circuit and A that of the external circuit, the 
power delivered to the external resistance (useful power) will equal 


6°R 5 for any electrical circuit; here, € is the emf. Substituting 


0 
the value of the thermoelectromotive force, we obtain for the power 
of a thermocouple the expression 


a? (Ty—T2)? 


R \ 
(R+ Ro)? * 
The electromotive force of a thermocouple is of the order of several 
tenths of a volt. To obtain a voltage of 120 v, for example, thermo- 
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couples are connected in series like a battery. If heavy currents are 
required, thermocouples must be connected in parallel. 

Another important application of the thermoeffect, which has 
also become possible as the result of the development of semicon- 
ductor engineering, is the employment of semiconductors as a re- 
frigerator. 

The application of the thermoelectric effect to the measurement 
of temperature is well known and need not be discussed. 

An important and long known field of application of the thermo- 
effect is in the detection of very small amounts of heat. The opportu- 
nities in this field have increased still further as a result of the fact 
that semiconductors yield large thermoelectromotive forces. For 
such purposes, thermocouples connected in series—so-called ther- 
mopiles—are used. Every other junction of a thermopile is cooled 
and the alternate ones are heated. Thermopiles are used to measure 
power levels as low as several ergs per second. However, it is pos- 
sible to lower this limit still further, i.e., to several tenths of an 
erg per second. This is achieved by means of vacuum thermocouples, 
the thermal loses of which are reduced to a minimum. 
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